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Description 

[0001] The present invention relates to thernnostable DMA polymerases fronn hyperthermophilic archael Pyrodic- 
tiunn species and means for isolating and producing the enzymes. Thernnostable DNA polymerases are useful in many 
5 recombinant DNA techniques, especially nucleic acid amplification by the polymerase chain reaction (PCR). 

[0002] Extensive research has been conducted on the isolation of DNA polymerases from mesophilic microorgan- 
isms such as E. coli. See, for example, Bessman et al., 1957, J. Biol. Chem. 223:171-177, and Buttin and Kornberg, 
1966. J Biol. Chem. 24_1:541 9-5427. 

[0003] Interest in DNA polymerases from the thermophilic microbes increased with the invention of nucleic acid 
w amplification processes. The use of thermostable enzymes, such as those described in U.S. Patent No. 4,165,188, to 
amplify existing nucleic acid sequences in amounts that are large compared to the amount initially present was 
described United States Patent Nos. 4,683,195 and 4,683,202, which describe the PCR process. These patents are 
incorporated herein by reference. The PCR process involves denaturation of a target nucleic acid, hybridization of prim- 
ers, and synthesis of complementary strands catalyzed by a DNA polymerase. The extension product of each primer 
15 becomes a template for the production of the desired nucleic acid sequence. These patents disclose that, if the 
polymerase employed is a thermostable enzyme, then polymerase need not be added after every denaturation step, 
because heat will not destroy the polymerase activity. 

[0004] The thermostable DNA polymerase from Thermus aquaticus (Taq) has been cloned, expressed, and purified 
from recombinant celts as described in Lawyer et at., 1989, J. Btol. Chem. 264:6427-6437, and U.S. Patent Nos. 
po 4,889,81 8 and 5,079,352, which are incorporated herein by reference. Crude preparations of a DNA polymerase activ- 
ity isolated from T aquaticus have been described by others (Chien et at., 1976, J. Bacterid. 127:1550-1557, and Kal- 
edin et at., 1980, Biokhimiya 45:644-651). 

[0005] U.S. Patent No. 4,889,818, European Patent Application, Publication No. 258,017, and PCT Publication No. 
WO 89/06691 , the disclosures of which are incorporated herein by reference, all describe the isolation and recombinant 

P5 expression of an -94 kDa thermostable DNA polymerase from Thermus aquaticus and the use of that polymerase in 
PCR. Although T. aquaticus DNA polymerase is especially preferred for use in PCR and other recombinant DNA tech- 
niques, a number of other thermophilic DNA polymerases have been purified, cloned, and expressed. (See co pending, 
commonly assigned PCT Publication Nos. WO 91/09950, WO 92/03556, WO 92/06200, and WO 92/06202, which are 
incorporated herein by reference.) 

30 [0006] Thermostable DNA polymerases are not in-eversibly inactivated even when heated to 93-95''C for brief peri- 
ods of time, as, for example, in the practice of DNA amplification by PCR. In contrast, at this elevated temperature E. 
coli DNA Pol I is inactivated. 

[0007] Archaeal hyperthennophiles, such as Pyrodictium and Methanopyrus species, grow at temperatures up to 
about 1 10°C and are unable to grow below BO^C (see, Stetter et at , 1990, FEMS Microbiology Reviews Z5: 117-124, 

35 which is incorporated herein by reference). These sulfur reducing, strict anaerobes are isolated from submarine envi- 
ronments. For example, P abyssi was isolated from a deep sea active "smoker" chimney off Guaymas Mexico at 2,000 
meters depth and in 320^C of venting water (Pley et al., 1 991 , Systematic and Applied Microbiology 14:245). In contrast 
to the Pyrodictium species, other thermophilic microorganisms having optimum growth temperature at or about 90°C 
and a maximum growth temperature at or about lOO^'C are not difficult to culture. For example, a gene encoding DNA 

■io polymerase has been cloned and sequenced from Thermococcus litoralis (European Patent Application. Publication 
No. 455,430). 

[0008] In contrast, culture of the extreme hyperthenmophilic microorganisms is made difficult by their inability to 
grow on agar solidified media. Individual cells of the Pyrodictium species are extremely fragile, and the organisms grow 
as fibrous networks. Standard bacterial fermentation techniques are extremely difficult for culturing Pyrodictium species 

45 due to the fragility of the cells and tendency of the cells to grow as networks clogging the steel parts of conventional 
fermentation apparatus. (See Staley, J.T et al. eds., Bergey's Manual of Systematic Bacteriology, 1989, Williams and 
Wilkins, Baltimore, which is incorporated herein by reference.) These difficulties preclude laboratory culture for prepar- 
ing large amounts of purified nucleic acid polymerase enzymes for characterization and amino acid sequence analysis. 
Those skilled in the art may be able to culture Pyrodictium to a cell density approaching 1 0^-1 0'' cells/ml (see, for exam- 

50 pie, Phipps et al., 1991. EMBO J. 10(7);1 71 1 -1 722). In contrast, E. coli is routinely grown to 0.3 - 1.0 x 10^^ celts/ml. 
[0009] Accordingly there is a need for further characterizing these hyperthermophile DNA polymerase enzymes, 
e.g. by determining their amino acid sequence and the DNA sequence encoding it, By cloning and expressing the gene 
in a suitable host organism the prior difficulties associated with the cultivation of the native host can be avoided. In addi- 
tion there is a desire in the art to produce thermostable DNA polymerases having enhanced thermostability that may 

5.^ be used to improve the PCR process and to improve the results obtained when using a thermostable DNA polymerase 
in other recombinant techniques such as DNA sequencing, nick-translation, and reverse transcription. 
[0010] The present invention meets these needs by providing DNA and amino acid sequence information, recom- 
binant expression vectors and purification protocols for DNA polymerases from Pyrodictium species. 
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[001 IJ The present invention provides thermostable enzymes that catalyze the combination of nucleoside triphos- 
phates to form a nucleic acid strand complementary to a nucleic acid template strand. The enzymes are DNA polymer- 
ases from Pyrodictium species. In a preferred embodiment, the enzyme is from P. occultum or P. abyssi. This material 
may be used in a temperature-cycling amplification reaction wherein nucleic acid sequences are produced from a given 
5 nucleic acid sequence in amounts that are large compared to the amount inrtially present so that the sequences can be 
manipulated and/or analyzed easily. 

[0012] The genes encoding the P occultum and P abyssi DNA polymerase enzyme have also been identified and 
cloned and provide yet another means to prepare the themnostable enzyme of the present invention. In addition, DNA 
and amino acid sequences of the genes encoding the P. occultum and P abyssi enzyme derivatives of these genes 
10 encoding P. occultum and P. abyssi DNA polymerase activity are also provided. In addition, modified genes encoding 
and expressing 3'-5' exonuclease-deficient form of Pyrodictium occultum and P abyssi DNA polymerase activity are 
also provided- 

[0013] The invention also encompasses stable enzyme compositions comprising a purified, thermostable P. occul- 
tum and/or R abyssi enzyme as described above in a buffer containing one or more non-ionic polymeric detergents. 
15 [00141 Finally, the invention provides a method of purification for the thermostable polymerase of the invention. 
[0015] Thus, the present invention provides DNA sequences and expression vectors that encode Pyrodictium DNA 
polymerase. To facilitate understanding of the invention, a number of terms are defined below. 

[0016] The terms "cell," "cell line," and "cell culture" can be used interchangeably and all such designations include 
progeny. Thus, the words "transfomnants" or "transformed cells" include the primary transformed cell and cultures 
po derived from that cell without regard to the number of transfers. All progeny may not be precisely identical in DNA con- 
tent, due to deliberate or inadvertent mutations. Mutant progeny that have the same functionality as screened for in the 
originally transformed cell are included in the definition of transformants. 

[0017] The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked 
coding sequence in a particular host organism The control sequences that are suitable for procaryotes, for example. 
P5 include a promoter, optionally a operator sequence, a ribosome binding site, and possibly other sequences. Eucaryotic 
cells are known to utilize promoters, polyadenylation signals, and enhancers. 

[0018] The term "expression system" refers to DNA sequences containing a desired coding sequence and control 
sequences in operable linkage, so that hosts transfonned with these sequences are capable of producing the encoded 
proteins. To effect transformation, the expression system may be included on a vector, however, the relevant DNA may 

30 also be integrated into the host chromosome. 

[0019] The term "gene" refers to a DNA sequence that comprises control and coding sequences necessary for the 
production of a recoverable bioactive polypeptide or precursor. The polypeptide can be encoded by a full length gene 
sequence or by any portion of the coding sequence so long as the enzymatic activity is retained. 
[0020] The term "operably linked" refers to the positioning of the coding sequence such that control sequences will 

35 function to drive expression of the protein encoded by the coding sequence. Thus, a coding sequence "operably linked" 
to expression control sequences refers to a configuration wherein the coding sequences can be expressed under the 
direction of a control sequence. 

[0021] The term "mixture" as it relates to mixtures containing Pyrodictium polymerase refers to a collection of mate- 
rials which includes Pyrodictium polymerase but which can also include other proteins. If the Pyrodictium polymerase 
40 is derived from recombinant host cells, the other proteins will ordinarily be those associated with the host. Where the 
host is bacterial, the contaminating proteins will, of course, be bacterial proteins. 

[0022] The tenn "non-ionic Polymeric detergents" refers to surface-active agents that have no ionic charge ad that 
are characterized for purposes of this invention, by an ability to stabilize the Pyrodictium enzyme at a pH range of from 
about 3.5 to about 9.5, preferably at a pH range from 4.0 to 9.0. 
45 [0023] The temn "oligonucleotide" as used herein is defined as a molecule comprised of two or more deoxyhbonu- 
cleotides or ribonucleotides, preferably more than three, and usually more than ten. The exact size of a oligonucleotide 
will depend on may factors, including the ultimate function or use of the oligonucleotide. 

[0024] Oligonucleotides can be prepared by any suitable method, including, for example, cloning and restriction of 
appropriate sequences and direct chemical synthesis by a method such as the phosphotriester method of Narang et al, 
50 1979, Meth Enzymol. 68:90-99; the phosphodiester method of Brown et al., 1979, Meth, Enzymol, 68:109-151; the 
diethytphosphoramidite method of Beaucage et al, 1981, Tetrahedron Lett 22:1859-1862; the triester method of Mat- 
teucci et al.. 1981, J. Am. Chem Soc. .103:3185-3191 or automated synthesis methods; and the solid support method 
of U.S. Patent No. 4,458.066. 

[0025] The temi "primer" as used herein refers to a oligonucleotide, whether natural or synthetic, which is capable 
.^5 of acting as a point of initiation of synthesis when placed under conditions in which primer extension is initiated. Syn- 
thesis of a primer extension product which is complementary to a nucleic acid strand is initiated m the presence of nucl- 
eoside triphosphates and a DNA polymerase or reverse transcriptase enzyme in an appropriate buffer at a suitable 
temperature, A "buffer* includes cofactors (such as divalent metal ions) and salt (to provide the appropriate ionic 
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strength), adjusted to the desired pH, For Pyrodictium potymerases, the buffer preferably contains 1 to 3 mM of a nnag- 
nesium salt, preferably MgCl2, 50 to 200 of each nucleotide, ad 0.2 to 1 |iM of each primer, along with 10-100 mM 
KCI, 10 mM Tris buffer {pH 7.5-8.5), and 100 \xg/m\ gelatin {although gelatin is not required, and should be avoided in 
some applications, such as DNA sequencing). 

5 [0026] A primer is preferably a single-stranded oligodeoxyribonucleotide. The appropriate length of a primer 
depends on the intended use of the primer but typically ranges from 1 5 to 35 nucleotides. Short primer molecules gen- 
erally require cooler temperatures to form sufficiently stable hybrid complexes with the template. A primer need not 
reflect the exact sequence of the template but must be sufficiently complementary to hybridize with a template. 
[0027] The temn "primer" may refer to more than one primer, particularly in the case where there is some ambiguity 

w in the information regarding one or both ends of the target region to be amplified. For instance, if a nucleic acid 
sequence is inferred from a protein sequence, a "primer" is actually a collection of primer oligonucleotides containing 
sequences representing all possible codon variations based on the degeneracy of the genetic code. One of the primers 
in this collection will be homologous with the end of the target sequence. Likewise, if a "conserved' region shows sig- 
nificant levels of polymorphism in a population, mixtures of primers can be prepared that will amplify adjacent 

15 sequences. 

[0028] A primer may be "substantially" complementary to a strand of specific sequence of the template. A primer 
must be sufficiently complementary to hybridize with a template strand for primer elongation to occur. A primer 
sequence need not reflect the exact sequence of the template. For example, a non-complementary nucleotide fragment 
may be attached to the 5' end of the primer, with the remainder of the primer sequence being substantially complemen- 
.K) tary to the strand Non-complementar/ bases or longer sequences can be interspersed into the primer, provided that the 
primer sequence has sufficient complementarity with the sequence of the template to hybridize and thereby fomn a tem- 
plate primer complex for synthesis of the extension product of the primer 

[0029] A primer can be labeled, if desired, by incorporating a label detectable by spectroscopic, photochemical, bio- 
chemical, immunochemical, or chemical means. For example, useful labels include ^P, fluorescent dyes, electron- 
.>5 dense reagents, enzymes (as commonly used in ELISAs), biotin. or haptens and proteins for which antisera or mono- 
clonal antibodies are available. A label can also be used to "capture" the primer, so as to facilitate the immobilization of 
either the primer or a primer extension product, such as amplified DNA, on a solid support. 

[0030] The terms "restriction endonucleases" and "restriction enzymes" refer to bacterial enzymes which cut dou- 
ble-stranded DNA at or near a specific nucleotide sequence. 

..K> [0031] The terms "thermostable polymerase" and "themnostable enzyme" refer to an enzyme which is stable to 
heat and is heat resistant and catalyzes combination of the nucleotides in the proper manner to form primer extension 
products that are complementary to a template nucleic acid strand. Generally, synthesis of a primer extension product 
begins at the 3' end of the primer and proceeds in the 5' direction along the template strand, until synthesis terminates. 
[0032] The Pyrodictium thermostable enzymes of the present invention satisfy the requirements for effective use in 

35 the amplification reaction known as the polymerase chain reaction or PGR as described in U.S. Patent No. 4,965,188 
(incorporated herein by reference). The Pyrodictium enzymes do not become irreversibly denatured (inactivated) when 
subjected to the elevated temperatures for the time necessary to effect denaturation of double-stranded nucleic acids, 
a key step in the PGR process. In-eversible denaturation for purposes herein refers to permanent and complete loss of 
enzymatic activity The heating conditions necessary for nucleic acid denaturation will depend, e.g., on the buffer salt 

•TO concentration and the composition and length of the nucleic acids being denatured, but typically range from about 90''G 
to about 1 05^G for a time depending mainly on the temperature and the nucleic acid length, typically from a few seconds 
up to four minutes. 

[0033] Higher temperatures may be required as the buffer salt concentration and/or GC composition of the nucleic 
acid is increased. The Pyrodictium enzymes do not become irreversibly denatured from relatively short exposures to 

45 temperatures of about QS^G-IOO^'C. The extreme thermostability of the Pyrodictium DNA polymerase enzymes pro- 
vides additional advantages over previously characterised thermostable enzymes. Prior to the present invention, effi- 
cient PGR at denaturation temperatures as high as 100X had not been demonstrated. No thermostable DNA 
polymerases have been described up to now for this purpose. However, as the G/C content of a target nucleic acid 
increases, the temperature necessary to denature (Tf^en). the duplex also increases. For target sequences that require 

50 a T,j^,p step of over 95^C, previous protocols require that solvents are included in the PGR for partially destabilizing the 
duplex, thus, lowering the effective T^j^rv Agents such as glycerol DMSO, or formamide have been used in this manner 
in PGR (Korge et a!., 1992, Proc. Natl. Acad Sci. USA 89:910-914, and Wong et al.. 1991, Nuc. Acids Res. 19:2251- 
2259, incorporated herein by reference). These agents, in addition to destabilizing duplex DNA will affect primer stabil- 
ity, can inhibit enzyme activity, and varying concentrations of DMSO or formamide decrease the thermoresistance (i.e., 

55 half-life) of thermophilic DNA polymerases. Accordingly, a significant number of optimization experiments and reaction 
conditions need to be evaluated when utTizmg these cosolvents. In contrast, simpty raising the J^^^ to 100'C with Poc 
or Pab DNA polymerase in an otherwise standard PGR can facilitate complete strand separation of PGR product elim- 
mating the need for DNA helix destabilizing agents. 
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[0034] The extreme hyperthermophilic polymerases disclosed herein are stable at temperatures exceeding 100"C, 
and even as high as 1 lO^C. However, at these temperatures depending on the pH and ionic strength, the integrity of 
the target DNA may be adversely affected (Ekert and Kunkel, 1992, In PGR: A Practical Approach, eds. McPherson, 
Quirke and Taylor, Oxford University Press, pages 225-244, incorporated herein by reference). 

5 [0035] The Pyrodlctium DNA polymerase has a optimum temperature at which it functions that is higher than about 
45°C. Temperatures below 45°C facilitate hybridization of primer to template, but depending on salt composition and 
concentration and primer composition and length, hybridization of primer to template can occur at higher temperatures 
(e.g., 45-70°C), which may promote specificity of the primer hybridization reaction. The enzymes of the invention exhibit 
activity over a broad temperature range up to 85°C. The optimal activity is template dependent and generally in the 

10 range of 70-80°C. 

[0036] The present invention provides DNA sequences encoding the thermostable DNA polymerase activity of 
Pyrodictium species. The preferred embodiments of the invention provide the nucleic acid and amino acid sequences 
for P. abyssi and R occultum DNA polymerase. The entire R abyssi and R occultum DNA polymerase coding 
sequences are depicted below as SEQ !D No. 1 {P. abyssi) and SEQ ID No. 3 (P occultum). The deduced amino acid 
rs sequences are listed as SEQ ID No. 2 (P. abyssi) and SEQ ID No. 4 (P. occultum). For convenience, the nucleotide and 
amino acid sequences of these polymerases are numbered for reference. 

[0037] The present invention provides nucleic acid sequences providing means for comparison of P. occultum and 
R abyssi DNA polymerase sequences with other thermostable polymerase enzymes. Such a comparison demonstrates 
that these novel sequences are unrelated to previously described nucleic acid sequences encoding eubacterial ther- 
mostable DNA polymerases. Consequently, methods for identifying Pyrodictium DNA polymerase enzymes based on 
the published sequences of known eubactrial thermostable DNA polymerases are not suitable for isolating nucleic acid 
sequences encoding Pyrodictium DNA polymerase enzymes. 
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P. abvssi DNA Polymerase 

SEQ ID No. 1 ATGCCAGAAGCTATAGAGTTCGTGCTCCTT 
SEQ ID No. 2 MetProGluAlalleGluPheValLouLeu 10 

3 1 GATTCAAGCTACGAGATTGTAGGGAAAGAGCCGGTAATCATACTATGGGGTGTAACGCTA 

AspSerSerTyrGluIleValGlyLysGluProValllelleLouTrpGlyValThrLeu 30 

9 1 GACGGTAAACGCATAGTCCTACTTGATAGGAGGTTTAGGCCCTACTTCTATGCACTCATA 

AspGlyLysArglleValLeuLeuAspArgArgPheArgProTyrPheTyrAlaLeuIle 50 

151 TCCCGCGACTACGAAGGTAAGGCCGAGGAGGTAGTAGCTGCTATTAGAAGGCTAAGTATG 

SerArgAspTyrGluGlyLysAlaGluGluValValAlaAlalleArgArgLouSerMet 70 

211 GCAAAGAGCCCCATAATAGAAGCAAAGGTGGTTAGTAAGAAGTACTTCGGAAGGCCCCGT 

AlaLysSerProIlelleGluAlaLysVaXValSerLysLysTyrPheGlyArgProArg 90 

271 AAAGCAGTCAAAGTAACGACAGTTATACCCGAATCTGTCAGAGAATATAGAGAGGCTGTA 

LysAlaValLysValThrThrVallleProGluSerValArgGluTyrArgGluAlaVal 110 

331 AAAAAGCTGGAAGGCGTGGAAGACTCTCTAGAAGCAGACATAAGGTTCGCGATGAGGT AT 

LysLyaLeuGluGlyValGluAspSerLouGluAlaAspIleArgPheAlaMetArgTyr 130 

391 CTAATCGACAAGAAGCTCTACCCGTTCACAGCATACCGTGTCAGAGCCGAGAACGCTGGA 

LeuIleAspLysLysLeuTyrProPheThrAlaTyrArgValArgAlaGluAanAlaGly 150 

451 CGCAGCCCTGGTTTCCGTGTAGACTCGGTATACACTATAGTTGAGGACCCAGAGCCTATT 

ArgSerProGlyPheArgValA9pSerValTyrThrIleValGluAspProGluProIle 170 

511 GCCGACATAACTAGTATAGATATACCAGAGATGCGTGTGCTCGCGTTCGACATAGAGGTC 

AlaAspIleThrSerXleAspIleProGluMetArgValLexiAlaPheAspIleGluVal 190 

571 T ACAGT AAGAGAGGAAGCCCTAACCCGTCCCGCGACCCGGTCATAATAATCTCGATAAAG 

TyrSerLysArgGlySerProAsnProSerArgAapProValllellelleSerlleLya 210 

631 GACAGCAAGGGGAACGAGAAGCTACTAGAAGCCAATAACTACGACGACAGAAACGTGCTA 

AspSerLysGlyAsnGluLysLeuLeuGluAlaAsnAsnTyrAspAspArgAsnValLeu 230 

691 CGGGAATTTATAGAGTACATACGCTCCTTTGACCCAGACATAATAGTAGGCTACAATAGC 

ArgGluPhelleGluTyrlleArgSerPheAspProAapXlelleValGlyTyrAsnSer 250 

7 51 AACAATTTTGACTGGCCATACCTTATAGAACGTGCACACAGAATAGGAGTAAAGCTCGAC 

AsnAsnPheAspTrpProTyrLeuIleGluArgAlaHisArglleGlyValLysLeuAsp 27 0 

811 GTGACAAGGCGTGTTGGCGCAGAGCCAAGTATGAGCGTCTATGGACATGTCTCAGTGCAG 

ValThrArgArgValGlyAlaGluProSerMetSerValTyrGlyHisValSerValGln 290 

871 GOT AGGCTAAACGT AGACCTCTACAACTACGTGGAGGAAATGCATGAGATAAAGGTAAAG 

GlyArgLeuAsnValAapLeuTyrAanTyrValGluGluMetHisGluIleLyaValLys 310 
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9 3 1 ACGCTCGAGGAGGTCGCCGAATACCTAGGCGTTATGCGCAAGAGCGAGCGCGTACTAATA 

ThrLeuGluGluValAlaGluryrLeuGlyValMctArgLysSerGluArgValLAUIle 330 

991 GAATGGTGGCGGATCCCAGATTACTGGGACGACGAGAAGAAACGGCCGCTACTGAAGCGT 

GluTrpTrpArglleProAapTyrTrpAspAspGluLysLysArgProLeuLeuLyaArg 350 

1051 TATGCCCTCGACGATGTGAGAGCCACCTACGGCCTCGCCGAGAAGATACTCCCATTCGCA 

TyrAlaLeuAspAapValArgAlaThrTyrGlyLeuAlaGluLysIleLeuProPheAla 370 

1 1 1 1 ATACAGCTTTCGACAGTAACCGGTGTTCCTTTAGACCAAGTCGGGGCTATGGGCGTAGGT 

UeGlnLeuSerThrValThrGlyValProLeuAspGlnValGlyAlaMetGlyValGly 390 

1171 TTCCGTCTAGAATGGTACCTTATGAGAGCAGCGCATGATATGAACGAGCTTGTCX:CCAAC 

PheArgLeuGluTrpTyrLeuMetArgAlaAlaHiaAspMetAanGluLeuValProAsn 410 

1231 CGTGTCAAGCGGCGCGAAGAGAGCTACAAGGGAGCAGT AGTACTAAAGCCCCTAAAGGGT 

ArgValLysArgArgGluGluSerTyrLysGlyAlaValValLouLysProLeuLysGly 430 

1291 GTCCATGAGAACGTAGTAGTGCTCGACTTTAGCTCAATGTACCCCAACATAATGATAAAG 

ValHisGluAanValValValLeuAapPheSerSerMetTyrProAanlleMetlleLys 450 

1351 TACAATGTGGGCCCTGACAC6ATAATTGACGACCCCTCAGAGTGCGAGAAGT ACAGTGGA 

TyrAsnValGlyProAapThrllelleAapAapProSerGluCysGluLysTyrSerGly 470 

1411 TGCTACGTAGCCCCCGAAGTCGGGCACATGTTTAGGCGCTCGCCCTCCGGCTTCTTrAAG 

CysTyrValAlaProGluValGlyHlaMetPheArgArgSerProSerGlyPhePheLys 490 

1471 ACCGTGCTTGAGAACCTCATAGCGCTGCGTAAGCAAGTACGTGAAAAGATGAAGGAGTTC 

ThrValLcuGluAsnLeuIleAlaLeuArgLysGlnValArgGluLysMetLysGluPhe 510 

1531 CCCCCAGATAGCCCAGAATACCGGATATACGATGAACGCCAGAAGGCACTCAAGGTGCTA 

ProProAspSerProGluTyrArglleTyrAapGluArgGlnLysAlaLeuLyaValLeu 530 

1591 GCCAACGCTAGCTACGGCTACATGGGATGGGTGCACGCTCGCTGGTACTGTAAACGCTGC 

AlaAsoAlaSerTyrGlyTyrMetGlyTrpValHisAlaArgTrpTyrCysLysArgCys 550 

1651 GCAGAGGCTGTAACAGCCTGGGGCCGTAACCTGATACTCTCAGCAATAGAATATGCTAGG 

AlaGluAlaValThrAlaTrpGlyArgA3nLeuIleLeuS«rAlaIleGluTyrAlaArg 570 

1711 AAGCTCGGCCTCAAAGTAATATACGGAGACACGGACTCCCTATTCGTAACCTATGATATC 

LysLeuGlyLeuLyaVallleTyrGlyAapThrAspSerLeuPheValThrTyrAapIle 590 

1771 GAGAAGGTAAAGAAGCTAATAGAATTCGTCGAGAAACAGCTAGGCTTCGAGATAAAGATA 

GluLysValLysLysLeuIleGluPheValGluLysGlnLeuGlyPheGluIleLysIle 610 

1831 GACAAGGTATACAAAAGAGTGTTCTTTACCGAGGCAAAGAAGCGCTACGTGGGCCTCCTC 

AspLyaValTyrLysArgValPhePheThrGluAlaLysLysArgTyrValGlyLeuLeu 630 

1891 GAGGACGGGCGTATGGACATAGTAGGCTTTGAGGCTGTTAGAGGCGACTGGTGTGAGCTA 

GluAspGlyArgMetAspIleValGlyPheGluAlaValArgGlyAapTrpCysGluLeu 650 

1951 GCTAAAGAGGTGCAAGAGAAAGTAGCAGAGATAATACTGAAGACGGGAGACATAAATAGA 

AlaLysGluValGlnGluLyaValAlaGluIlelleLeuLysThrGlyAspIleAsnArg 670 

2 011 GCCATAAGCTACATAAGAGAGGTCGTGAGAAAGCTAAGAGAAGGCAAGATACCCATAACA 

AlaIleSerTyrIleArgGluValValArgLy3L«uArgGluGlyLy3lleProIleThr 690 

2071 AAGCTCGTAATATGGAAGACCTTGACAAAGAGAATCGAGGAATACGAGCACGAGGCGCCG 

LyaLeuVallleTrpLysThrLeuThrLysArglleGluGluTyrGluHlsGluAlaPro 710 

2131 CACGTTACTGCAGCACGGCGTATGAAAGAAGCAGGCTACGATGTGGCACCGGGAGACAAG 

HiaValThrAlaAlaArgArgMetLysGluAlaGlyTyrAspValAlaProGlyAspLys 730 
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2191 ATAGGCTACATCATAGTTAAAGGACATGGCAGTATATCGAGTCGTGCCT ACCCGT ACTTT 

IleGlyTyrllelleValLyaGlyHisGlySerlleSftxSerArgAlaTyrProTyrPhe 750 

2251 ATGGTAGACTCGTCTAAGGTTGACACACiAGT ACT ACATAGACCACCAGATAGTACCAGCA 

MotValAspSerSerLysValAapThrGluTyrTyrlleAspHisGlnlleValProAla 770 

2311 GCAATGAGGATACTCTCATACTTCGGGGTCACAGAGAAGCAGCTTAAGGCAGCATCATCT 

AlaMetArgIleLeuS«rTyrPheGlyValThrGluLy3GlnL«uLy»AlaAlaSerSftr 790 

2371 GGGCATAGGAGTCTCTTCGACTTCTTCGCGGCAAAGAAGTAGccccggctctCCaaacta 

GlyHisArgSerLeuPhGAapPhePheAlaAlaLyaLya * 803 

P. occultum DNA Polymerase 

SEQ ID No. 3 ATGACAGAGACTATAGAGTTCGTGCTGCTA 
SEQ ID No. 4 MotrhrGluThrIleGluPh«V«lLeuLeu 10 

3 1 GACTCTAGCTACGAGATACTGGGGAAGGAGCCGGTAGTAATCCTCTGGGGGArAACGCTT 

AapSerSerTyrGluIleLeuGXyLysGluProValVallleLeuTrpGlylleThrLeu 30 

9 1 GACGGTAAACGTGTCGTGCTTCTAGACCACCGCTTCCGCCCCT ACTTCTACGCCCTCATA 

AspGlyLy»ArgValValL©uLeuA3pHi3ArgPh«ArgProTyrPheTyrAlaLeuIl« 50 

151 GCCCGGGGCTATGAGGATATGGTGGAGGAGATAGCAGCTTCXATAAGGAGGCTTAGTGTG 

AlaArgGlyTyrGluAspMetValGluGluIleAlaAlaSerlXeArgArgLouSerVal 70 

211 GTCAAGAGTCCGATAATAGATGCCAAGCCTCTTGATAAGAGGTACTTCGGCAGGCCCCGT 

ValLysSerProIlelleAapAlaLysProLeuAapLysArgTyrPbeGlyArgProArg 90 

271 AAGGCGGTGAAGATTACCACTATGATACCCGAGTCTGTTAGACACTACCGCGAGGCGGTG 

Ly sAl aVa 1 Ly 3 1 leThrThrMet 1 1 eProGluS«rValArgHisTy rArgGluAl aVal 110 

331 AAGAAGATAGAGGGTGTGGAGGACTCCCTCGAGGCAGAT ATAAGGTTTGCAATGAGATAT 

LysLyalleGluGlyValGluAspS^xLeuGluAlaAapIloArgPheAlaMotAxgTyr 130 

391 CTGATAGATAAGAGGCTCTACCCGTTCACGGTTTACCGGATCCCCGTAGAGGATGCGGGC 

LeuIleAspLysArgLeuTyrProPheThrValTyrArglleProValGauAspAlaGly 150 

451 CGCAATCCAGGCTTCCGTGTTGACCGTGTCTACAAGGTTGCTGGCGACCCGGAGCCCCTA 

ArgAanProGlyPheArgValAspArgValTyrLyaValAlaGlyAapProGluProLeu 170 

5 1 1 GCGGATATAACGCGGATCGACXTTCCCCCGATGAGGCTGGTAGCTTTTGATATAGAGGTG 

AlaAspIleThrArglleAspLeuProProMetArgLeuValAlaPhoAapIleGluVal 190 

571 TATAGCAGGAGGGGGAGCCCTAACCCTGCAAGGGATCCAGTGATAATAGTGTCGCTGAGG 

TyrSerArgArgGlySerProAanProAlaArgAspProValllelloValSerLeuArg 210 

631 GACAGCGAGGGCAAGGAGAGGCT^TAGAAGCTGAAGGCCATGACGACAGGAGGGTTCTG 

AapSerGluGlyLysGluArgLeuIleGluAlaGluGlyHisAspAspArgArgValLeu 230 

691 AGGGAGTTCGTAGAGTACGTGAGAGCCTTCGACCCCGACATAATAGTGGGCTATAACAGT 

ArgGluPheValGluTyrValArgAlaPheAspProAspIlelleValGlyTyrAsnSer 250 

7 51 AACCACTTCGACTGGCCCTACCTAATGGAGCGCGCCCGTAGGCTCGGGATTAACCTCGAC 

AsnHisPheAapTrpProTyrLeuMetGluArgAlaArgArgLouGlylleAanLeuAsp 270 

811 GTTACACGCCGTGTGGGGGCAGAGCCCACCACCAGCGTCTACGGCCACGTCTCGGTGCAG 

ValThrArgArgValGlyAlaGluProThrThrSerValTyrGlyHisValSerValGln 290 

871 GGT AGGCTGAACGTGGACCTCT ACGACTATGCCGAGGAGATGCCGGAGATAAAGATGAAG 

GlyArqLeuAsnValAspLeuTyrAspTyrAlaGluGluMetProGluIleLysHetLys 310 

931 ACGCTTGAGGAGGTAGCGGAGT ACCTAGGCGTTATGAAGAAGAGCGAGCGTGTGATAAT A 

ThrLeuGluGluValAlaGluTyrLeuGlyValMetLysLysSerGluArgValllelle 330 
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991 GAGTGGTGGAGGAT ACCCGAGTACTGGGATGACGAGAAGAAGAGGCAGCTGCT AGAGCGC 

GluTrpTrpArglleProGluTyrTrpAspAspGluLysLysArgGlnLeuLeuGluArg 350 

1051 TACGCGCTCGACGATGTGAGGGCTACCTACGGCCTCGCGGAAAAGArGCTACCGTTCGCC 

TyrAlaLeuA3pA5pValArgAlaThrTyrGlyLeuAlaGluLysMetL«uProPheAla 370 

1111 ATACAGCTCTCCACTGTTACGGGTGTGCCTCTCGACCAGGTAGGTGCTATGGGCGTAGGC 

I leGlnLeuSerThrValThrGlyValProLeuAspGlnValGlyAlaMetGlyValGly 390 

1171 TTCCGCCTAGAGTGGTATCTCATGCGTGCAGCCTACGATATGAACGAGCTGGTGCCGAAC 

PheArgLeuGluTrpTyrLeuMetArgAlaAlaTyrAapMetAanGluLeuValProAan 410 

1231 CGGGTGGAGAGGAGGGGGGAGAGCTACAAGGGTGCAGTAGTGTTAAAGCCTCTCAAGGGA 

ArgValGluArgArgGlyGluSarTyrLyaGlyAlaValValLeuLyaProLeuLyaGly 430 

1291 GTCCATGAGAATGTTGTGGTGCTCGATTTCAGTTCCATGT ACCCGAGCATAATGATAAAG 

ValHisGluA3nValValValLe\iA3pPheSorSerMetTyrProS«rIleMetIleLy3 450 

1351 TACAACGTGGGCCCCGACACTATAGTCGACGACCCCTCGGAGTGCCCAAAGTACGGCGGC 

TyrAsnValGlyProAflpThrlleValAspAapProSerGluCysProLyaTyrGlyGly 470 

1411 TGCTATGTAGCCCCCGAGGTCGGGCACCGGTTCCGTCGCTCCCCGCCAGGCTTCTTCAAG 

CyaTyrValAlaProGluValGlyHlsArgPheArgArgSerProProGlyPhePheLya 490 

1471 ACCGTGCTCGAGAACCTACTGAAGCTACGCCGACAGGTAAAGGAGAAGATGAAGGAGTTT 

ThrValLeuGluAsnLeuLeuLyaLeuArgArgGXnValLysGluLyaMetLysGluPhe 510 

1531 CCGCCTGACAGCCCCGAGTACAGGCTCTACGAlXiAGCGCCAGAAGGCGCTCAAGGTTC^ 

ProProAspSerProGluTyrArgLeuTyrAspGluArgGlnLyaAlaLeuLy aval Leu 530 

1591 GCGAACGCGAGCTATGGCTACATGGGGTGGAGCCATGCCCGCTGGTACTGCAAACGCTGC 

AlaAsnAlaSerTyrGlyTyrMetGlyTrpSerHiaAlaArgTrpTyrCyaLysArgCy© 550 

1651 GCCGAGGCTGTCACAGCCTGGGGCCGTAACCTTATACTGACAGCTATCGAGTATGCCAGG 

AlaGluAlaVairhrAlaTtpGlyArgAanLouIlaLeuThrAlalloGluTyrAlaArg 57 0 

1711 AAGCTCGGCCTAAAGGTTATATATGGAGACACCGACTCCCTCTTCGTGGTCTATGACAAG 

LyaLeuGlyLeuLyaVallleTyrGlyAapThrAapSarLeuPheValValTyrAspLya 590 

1771 GAGAAGGTTGAGAAGCTGATAGAGTTTGTCGAGAAGGAGCTGGGCTTTGAGATAAAGATA 

GluLyaValGluLyaLouIleGluPheValGluLyaGluLeuGlyPheGluIleLyalle 610 

1831 GACAAGATCTACAAGAAAGTGTTCTTCACGGAGGCTAAGAAGCGCTATGTAGGTCTCCTC 

AapLysIleTyrLyaLyaValPhePheThrGluAlaLyaLyaArgTyrValGlyLeuLeu 630 

1991 GAGGACGGACGTATAGACATCGTGGGCTTTGAAGCAGTCCGCGGCGACTGGTGCGAGCTG 

GluAapGlyArglleAspIleValGlyPheGluAlaValArgGlyAapTrpCysGluLeu 650 

1951 GCT AAGGAGGTGCAGGAGAAGGCGGCTGAGAT AGTGTTGAATACGGGGAACGTGGACAAG 

AlaLyaGluValGlnGluLyaAlaAlaGluIleValLeuAanThrGlyAsnValAapLys 67 0 

2011 GCTATAAGCTACATAAGGGAGGTAATAAAGCAGCTCCGCGAGGGCAAGGTGCCAATAACA 

AlalleSerTyrlleArgGluVallleLyaGlnLeuArgGluGlyLyaValProIleThr 690 

2071 AAGCTTATCATATGGAAGACGCTGAGCAAGAGGATAGAGGAGTACGAGCATGACGCGCCT 

LysLeuIlelleTrpLyaThrLeuSerLyaArglleGluGluTyrGluHisAapAlaPro 710 

2131 CATGTGATGGCTGCACGGCGTATGAAGGAGGCAGGCTACGAGGTGTCTCCCGGCGATAAG 

HiaValMetAlaAlaArgArgMetLyaGluAlaGlyTyrGluValSerProGlyAapLya 7 30 

2191 GTGGGCTACGTCATAGTTAAGGGTAGCGGGAGTGTGTCCAGCAGGGCCTACCCCTACTTC 

ValGlyTyrVallleValLyaGlySerGiySerValSerSerArgAlaTyrProTyrPhe 750 
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2251 ATGGTTGATCCATCGACCATCGACGTCAACTACTATATTGACCACCAGATAGTGCCGGCT 

MetValAspProSerThrlleAspValAsnTyrTyrlleAspHisGlnlleValProAla 770 

2311 GCTCTGAGGATACTCTCCTACTTCGGAGTCACCGAGAAACAGCTCAAGGCGGCGGCTACG 

AlaLeuArglleLeuSerTyrPheGlyValThrGluLysGlnLeuLysAlaAlaAlaThr 7 90 

2371 GTGCAGAGAAGCCTCTTCGACTTCTTCGCCTCAAAGAAATAGctcCtCCacccggctagc 

ValGlnArgSerLeuPheAspPhePheAlaSerLyaLys * 8 03 



[0038] As a result of the present invention, Pyrodictium DNA polymerase amino acid sequences can be used to 
design novel degenerate primers to find new, previously undiscovered hyperthermophilic DNA polymerase genes. The 

15 generic utility of the degenerate primer process is exemplified in PCT Publication No. WO 92/06202, which is incorpo- 
rated herein by reference. The publication describes the use of degenerate primers for cloning the gene encoding Ther- 
mosipho africanus DNA polymerase. Prior to the present invention, degenerate priming methods were demonstrated to 
be suitable for isolating genes encoding novel thermostable DNA polymerase enzymes. The success of these methods 
lies in part in the identification of conserved motifs among the thermostable DNA polymerases of, for example, Thermus 

20 aquaticus and Thermus thermophilus. 

[0039] Thus, due to the dissimilarity in DNA polymerase amino acid sequences between the extreme hyperther- 
mophiles, for example, Pyrodictium species, and non-hyperthermophiles such as Thermus species these degenerate 
priming methods were not previously suitable for isolating and expressing Pyrodictium polymerase genes. Applicants' 
invention has enabled the use of degenerate priming methods for isolating genes encoding novel DNA polymerase 

25 enzymes from extreme hyperthermophilic microbes. The gene encoding the DNA polymerase of the hyperthennophilic 
T litoralis (Tli) has been described. While Tli, Pab and Poc DNA polymerases contain the amino acid sequence motifs 
that reflect eucaryotic DNA polymerases, Pab and Poc DNA polymerases have only limited and spotty amino acid 
sequence identity with Tli DNA polymerase. Specifically, amino acid sequence alignments indicate only 37% to 39% 
sequence identity between Poc or Pab with Tli DNA polymerase. Significant regions of non-identity with Tli DNA 

30 polymerase occur in the 20 amino acids that precede and the 1 0 amino acids that follow Region 1 (position 438 through 
458 in SEQ ID Nos. 2 and 4). In addition, significant regions on non-identity with Tli DNA polymerase occur m the 1 0 to 
1 5 amino acids that precede, and the 1 0 to 15 amino acids that follow Region 4 (position 61 1 through 634 in SEQ ID 
Nos. 2 and 4). These regions as well as other portions of the polymerase active site are highly conserved in Poc and 
Pab DNA polymerases and contribute significantly to the extraordinary thermostability of these DNA polymerase 

35 enzymes. 

[0040] The present invention, by providing DNA and amino acid sequences for two Pyrodictium polymerase 
enzymes, therefore, enables the isolation of other extremely thermophilic DNA polymerase enzymes and the coding 
sequences for those enzymes. Further alignment of P. occultum and P. abyssi sequences with known thermostable 
enzyme sequences allows the selective identification of additional novel enzymes suitable for efficient PGR at denatur- 

40 atton temperatures of lOO^'C. 

[0041] The DNA and amino acid sequences shown above and the DNA compounds that encode those sequences 
can be used to design and construct recombinant DNA expression vectors to drive expression of Pyrodictium DNA 
polymerase activity in a wide variety of host cells. A DNA compound encoding ail or part of the DNA sequence shown 
above can also be used as a probe to identify thermostable polymerase-encoding DNA from other archaea, especially 

45 Pyrodictium species and the amino acid sequence shown above can be used to design peptides for use as immuno- 
gens to prepare antibodies that can be used to identify and purify a thermostable polymerase. 

[0042] Recombinant vectors that encode an amino acid sequence encoding a Pyrodictium DNA polymerase wilt 
typically be purified prior to use in a recombinant DNA technique. The present invention provides such purification 
methodology. 

50 [0043] The molecular weight of the DNA polymerase purified from recombinant E. coli host which express the P 
occultum or R abyssi polymerase genes are determined by the above method to be about 90 kDa. The molecular 
weight of this same DNA polymerase as detemnined by the predicted amino acid sequence is calculated to be approx- 
imately 92.6 kilo-daltons. 

[0044] An important aspect of the present invention is the production of recombinant Pyrodictium DNA polymerase. 
55 Thus, the present invention also provides a process for the preparation of thermostable DNA polymerases in accord- 
ance with the present invention, which process comprises the steps of: 

(a) culturing a host cell transformed with a DNA vector that comprises a DNA sequence encoding said thermosta- 
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ble DNA polymerase, and 

(b) isolating the thernnostable DNA polymerase produced in the host cell tronn the culture. 

[0045] As noted above, the gene encoding this enzyme has been cloned from two exemplary Pyrodictium species, 
5 P. occultum and R abyssi, genomic DNA. The complete coding sequence for the P. occultum (Poc) DNA polymerase 
can be easily obtained in an -2.52 kb Nhel restriction flagment of the plasmid pPoc 4. This plasmid was deposited with 
the American Type Culture Collection (ATCC) in host cell E. coli Sure® Cells (Stratagene) on May 11, 1993, under 
Accession No, 69309. The complete coding sequence for P abyssi (Pab) DNA polymerase can be easily obtained in an 
-3.74 kb Sail restriction fragment of the plasmid pPab 14. This plasmid was deposited with the ATCC in host cell E. coli 
10 Sure® Cells (Stratagene) on May 11, 1993, and under Accession No. 69310. 

[0046] The complete coding sequence and deduced amino acid sequence of the thermostable Pab and Poc DNA 
polymerase enzymes are provided above. The entire coding sequence of the DNA polymerase gene is not required, 
however, to produce a biologically active gene product with DNA polymerase activity. The availability of DNA encoding 
the Pyrodictium DNA polymerase sequence provides the opportunity to modify the coding sequence so as to generate 
15 mutein (mutant protein) forms also having DNA polymerase activity. Amino(N)4erminal deletions of approximately one- 
third of the coding sequence can provide a gene product that is quite active in polymerase assays. Because certain N- 
terminal shortened forms of the polymerase are active, the gene constructs used for expression of these polymerases 
can include the con-esponding shortened forms of the coding sequence. 

[0047] In addition to the N-terminal deletions, individual amino acid residues in the peptide chain comprising Pyro- 

20 dictium polymerase may be modified by oxidation, reduction, or other derivation, and the protein may be cleaved to 
obtain fragments that retain activity. Such alterations that do not destroy activ'rty do not remove the protein from the def- 
inition of a protein with Poc or Pab polymerase activity and so are specifically included within the scope of the present 
invention. Modifications to the primary structure of the Poc or Pab DNA polymerase gene by deletion, addition, or alter- 
ation so as to change the amino acids incorporated into the DNA polymerase during translation can be made without 

25 destroying the high temperature DNA polymerase activity of the protein. Such substitutions or other alternations result 
in the production of proteins having an amino acid sequence encoded by DNA falling within the contemplated scope of 
the present invention. Likewise, the cloned genomic sequence, or homologous synthetic sequences, of the Poc and 
Pab DNA polymerase genes can be used to express fusion polypeptides with Pyrodictium DNA polymerase activity or 
to express a protein with an amino acid sequence identical to that of native Poc or Pab DNA polymerase. 

30 [0048] Thus, the present invention provides the complete coding sequence for Pab and Poc DNA polymerase 
enzymes from which expression vectors applicable to a variety of host systems can be constructed and the coding 
sequence express. Portions of the present polymerase encoding sequence are also useful as probes to retrieve other 
thermostable polymerase-encoding sequences in a variety of species. Accordingly, portions of the genomic DNA 
encoding at least four to six amino acids can be synthesized as oligodeoxyribonucleotide probes that encode at least 

35 four to six amino acids and used to retrieve additional DNAs encoding a thermostable polymerase. Because there may 
not be an exact match between the nucleotide sequence of the thermostable DNA polymerase gene of Pab and Poc 
and the corresponding gene of other species, oligomers containing approximately 1 2-1 8 nucleotides (encoding the four 
to six amino acid sequence) are usually necessary to obtain hybridization under conditions of sufficient stringency to 
eliminate false positives. Sequences encoding six amino acids supply ample information for such probes. 

40 [0049] The present invention, by providing the coding and amino acid sequences for Pab and Poc DNA polymer- 
ases, therefore enables the isolation of other thermostable polymerase enzymes and the coding sequences for those 
enzymes. Specifically, the invention provides means for preparing primers and probes for identifying nucleic acids 
encoding DNA polymerase enzymes contained within DNA isolates from related archaebacteria such as extreme 
hyperthermophiles including additional Pyrodictium species, R brockii, and Methanopyrus species such as M. kandleri. 

45 [0050] Several such regions of similarity between the Pab and Poc DNA polymerase coding sequences exist. For 
regions nine codons in length, probes corresponding to these regions can be used to identity and isolate sequences 
encoding thermostable polymerase enzymes that are identical {and complementary) to the probe for a contiguous 
sequence of at least five codons. For the region six codons in length, a probe corresponding to this region can be used 
to identify and isolate thermostable polymerase encoding DNA sequences that are identical to the probe for a contigu- 

50 ous sequence of at least four codons. 

[0051] One property found in the Pyrodictium DNA polymerase enzymes, but lacking in native Taq DNA polymer- 
ase and native Tth DNA polymerase, is 3'-^5' exonuclease activity. This 3' exonuctease activity is generally consid- 
ered to be desirable, because misincorporated or unmatched bases of the synthesized nucleic acid sequence are 
eliminated by this activity. Therefore, the fidelity of PCR utilizing a polymerase with 3' ^5' exonuclease activity (e.g. 

55 Pyrodictium DNA polymerase enzymes) is increased. However, the 3' ^5' exonuclease activity found in Pyrodictium 
DNA polymerase enzymes can also increase non-specific background amplification in PCR by modifying the 3' end of 
the primers The 3'- ^5' exonuclease activity can eliminate single-stranded DNAs, such as primers or single-stranded 
terr.plate. In essence, every 3'-nucleotide of a single-stranded primer or template is treated by the enzyme as 
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unmatched and is therefore degraded. To avoid primer degradation in PGR, one can add phosphorothioale to ihe 3' 
ends of the primers. Phosphorothioate modified nucleotides are more resistant to removal by 3'-^5' exonucleases. 
[0052] Whether one desires to produce an enzyme identical to native Pab or Poc DNA polymerase or a derivative 
or homologue of that enzyme, the production of a recombinant form of the polymerase typically involves the construc- 

5 tion of an expression vector, the transformation of a host cell with the vector, and culture of the transformed host cell 
under conditions such that expression will occur To construct the expression vector, a DNA is obtained that encodes 
the mature (used here to include all muteins) enzynne or a fusion polypeptide of the polymerase, which fusion polypep- 
tide comprises an amino acid sequence derived from the polymerases of the present invention and an additional amino 
acid sequence that does not lead to the destruction of the polymerase activity or an additional amino acid sequence 

w cleavable under controlled conditions (such as treatment with peptidase) to give an active protein. The coding sequence 
is then placed in operable linkage with suitable control sequences in an expression vector. The vector can be designed 
to replicate autonomously in the host cell or to integrate into the chromosomal DNA of the host cell. The vector is used 
to transform a suitable host, and the transformed host is cultured under conditions suitable for expression of recom- 
binant Pyrodictium polymerase. The Pyrodictium polymerase is isolated from the medium or from the cells; recovery 

15 and purification of the protein may not be necessary in some instances, where some impurities may be tolerated. 

[0053] Construction of suitable vectors containing the desired coding and control sequences employs standard 
ligation and restriction techniques that are well understood in the art (see, for example, Molecular Cloning Laboratory 
Manual 2nd ed., Sambrook et al., 1989, Cold Spring Harbor Press, New York, NY, which is incorporated herein by ref- 
erence). Isolated plasmids, DNA sequences, or synthesized oligonucleotides are cleaved, modified, and religated in the 

20 form desired. Suitable restriction sites can, if not normally available, be added to the ends of the coding sequence so 
as to facilitate construction of an expression vector by methods well known in the art. 

[0054] For portions of vectors or coding sequences that require sequence modifications, a variety site-specific 
primer-directed mutagenesis methods are available. For example, the polymerase chain reaction (PGR) can be used to 
perform site-specific mutagenesis. PGR Protocols, ed. by Innis et al., 1990, Academic Press, San Diego, CA, and PGR 
25 Technology ed. by Henry Eriich. 1989, Stockton Press, New York, NY, describe methods for cloning, modifying, and 
sequencing DNA using PGR and are incorporated herein by reference. 

[0055] Control sequences, expression vectors, and transformation methods are dependent on the type of host cell 
used to express the gene. Generally, procaryotic, yeast, insect, or mammalian cells are used as hosts. Procaryotic 
hosts are in general the most efficient and convenient for the production of recombinant proteins and are, therefore, pre- 

30 ferred for the expression of Pyrodictium DNA polymerase enzymes. 

[0056] The procaryote most frequently used to express recombinant proteins is E. coli. For cloning and sequencing, 
and for expression of constructions under control of most bacterial promoters, E. coli K12 strain MM294, obtained from 
the E. coli Genetic Stock Center under GCSC #6135, can be used as the host. For expression vectors with the PlNrbs 
control sequence, E. coli K12 strain MCI 000 lambda lysogen, N7N53CI857 SusPgo, ATCC 39531, may be used. E. coli 

35 DG116, which was deposited with the ATCC (ATCC 53606) on April 7, 1987, and E. coli KB2, which was deposited with 
the ATCC (ATCC 53075) on March 29, 1985, are also useful host cells. For Ml 3 phage recombinants, E. coli strains 
susceptible to phage infection, such as E. coli K12 strain DG98, are employed. The DG98 strain was deposited with the 
ATCC (ATCC 39768) on July 1 3, 1 984. 

[0057] However, microbial strains other than E. coli can also be used, such as bacilli, for example Bacillus subtilis, 
40 various species of Pseudomonas, and other bacterial strains, for recombinant expression of Pyrodictium DNA polymer- 
ase enzymes. 

[0058] In addition to bacteria, eucaryotic microbes, such as yeast, can also be used as recombinant host cells. See, 
for example, Stinchcomb et al., 1979, Nature 282:39; Tschempe et at., 1980, Gene 10:157; and Clarke et al.. 1983, 
Meth. Enz. 101:300. 

45 [0059] The Pyrodictium gene can also be expressed in eucaryotic host cell cultures derived from multicellular 
organisms. See, for example, Tissue Culture, Academic Press, Cruz and Patterson, editors (1973). Useful host cell 
lines include COS-7, C0S'A2, CV-1 , murine cells such as murine myelomas N51 and VERO, HeLa cells, and Chinese 
hamster ovary (CHO) cells. Plant cells can also be used as hosts, and control sequences compatible with plant cells, 
such as the nopaline synthase promoter and polyadenylation signal sequences (Depicker et al., 1982, J. Mol. Appl. 

50 Gen. 1:561) are available. 

[0060] Depending on the host cell used, transformation is done using standard techniques appropriate to such 
celts. The calcium treatment employing calcium chloride, as described by Cohen, 1972. Proc. Natl. Acad. Sci. USA 
69:21 10 is used for procaryotes or other cells that contain substantial cell wall barriers. For mammalian cells, the cal- 
cium phosphate precipitation method of Graham and van der Eb, 1978, Virology 52:546 is preferred. Transformations 

55 into yeast are carried out according to the method of Van Solingen et al.. 1977. J. Bact. 130:946 and Hsiao et al.. 1979, 
Proc. Natl Acad. Sci. USA 76:3829. 

[0061] Once the Pyrodictium DNA polymerase has been expressed in a recombinant host cell, purification of the 
protein may be desired. Although the purification procedures previously described can be used to purify the recom- 
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binant thermostable polymerase of the invention, hydrophobic interaction chromatography purification methods are pre- 
ferred. Hydrophobic interaction chromatography is a separation technique in which substances are separated on the 
basis of differing strengths of hydrophobic interaction with an uncharged bed material containing hydrophobic groups. 
Typically, the column is first equilibrated under conditions favorable to hydrophobic binding, e.g., high ionic strength. A 

5 descending satt gradient may be used to elute the sample. 

[0062] Detailed protocols for purifying recombinant thermostable DNA polymerases have been described in, for 
example, PCT Patent Publication Nos. WO 92/03556, published March 5, 1992, and WO 91/09950, published July 11, 
1991. These publications are incorporated herein by reference. The methods described therein for Thermotoga mar- 
itima are suitable. Example 9 (see below) provides a preferred protocol for purifying recombinant Pyrodictium polymer- 

10 ase enzymes. 

[0063] For long-term stability, the Pyrodictium DNA polymerase enzyme is preferably stored in a buffer that contains 
one or more non-ionic polymeric detergents. Such detergents are generally those that have a molecular weight in the 
range of approximately 1 00 to 250,00 preferably about 4,000 to 200,000 daltons ad stabilize the enzyme at a pH of from 
about 3.5 to about 9.5, preferably from about 4 to 8.5. Examples of such detergents include those specified on pages 

15 295-298 of McCutcheon's Emulsifiers & Detergents, North America edition (1983), published by the McCutcheon Divi- 
sion of MC Publishing Co., 175 Rock Road, Glen Rock, NJ (USA), the entire disclosure of which is incorporated herem 
by reference. Preferably, the detergents are selected from the group comprising ethoxylated fatty alcohol ethers and 
lauryl ethers, ethoxylated alkyi phenols, octylphenoxy polyethoxy ethanol compounds, modified oxyethylated and/or 
oxypropylated straight-chain alcohols, polyethylene glycol monooleate compounds, polysorbate compounds, and phe- 

20 nolic fatty alcohol ethers. More particularly preferred are Tween 20, a poly oxyethylated (20) sorbitan monolaurate from 
ICI Americas Inc., Wilmington, D.E., and tconol™ NP-40, an ethoxylated alkyI phenol (nonyl) from BASF Wyandotte 
Corp. Parsippany, NJ. 

[0064] The thermostable enzyme of this invention may be used for any purpose in which such enzyme activity is 
necessary or desired. In a particularly preferred embodiment, the enzyme catalyzes the nucleic acid amplification reac- 
25 tion known as PGR. 

[0065] Although the PGR process is well known in the art (see U.S. Patent Nos. 4,683,195; 4,683,202; and 
4,965,188, each of which is incorporated herein by reference) and although commercial vendors, such as Perkin Elmer, 
sell PGR reagents and publish PGR protocols, some general PGR information is provided below for purposes of clarity 
and full understanding of the invention to those unfamiliar with the PGR process. 

30 [0066] To amplify a target nucleic acid sequence in a sample by PGR, the sequence must be accessible to the com- 
ponents of the amplification system. In general, this accessibility is ensured by isolating the nucleic acids from the sam- 
ple. A variety of techniques for extracting nucleic acids from biological samples are known in the art. For example, see 
those described in Higuchi et aL, 1989 in PGR Technology (Eriich ed., Stockton Press, New York). 
[0067] Because the nucleic acid in the sample is first denatured (assuming the sample nucleic acid is double- 

35 stranded) to begin the PGR process, and because simply heating some samples results in the disruption of cells, iso- 
lation of nucleic acid from the sample can sometimes be accomplished in conjunction with strand separation. Strand 
separation can be accomplished by any suitable denaturing method, however, including physical, chemical, or enzy- 
matic means. Typical heat denaturation involves temperatures ranging from about 80-1 OS^'G for times ranging from sec- 
onds to about 1 to 10 minutes. 

40 [0068] As noted above strand separation may be accomplished in conjunction with the isolation of the sample 
nucleic acid or as a separate step. In the preferred embodiment of the PGR process, strand separation is achieved by 
heating the reaction to a sufficiently high temperature for an effective time to cause the denaturation of the duplex, but 
not to cause an irreversible denaturation of the polymerase (see U.S. Patent No. 4,965,1 88). No matter how strand sep- 
aration is achieved, however, once the strands are separated, the next step in PGR involves hybridizing the separated 

45 strands with primers that flank the target sequence. The primers are then extended to form complementary copies of 
the target strands, and the cycle of denaturation, hybridization, and extension is repeated as many times as necessary 
to obtain the desired amount of amplified nucleic acid. 

[0069] For successful PGR amplification, the primers are designed so that the position at which each primer hybrid- 
izes along a duplex sequence is such that an extension product synthesized from one primer, when separated from the 
50 template (complement), serves as a template for the extension of the other primer to yield an amplified segment of 
nucleic acid of defined length. 

[0070] Template-dependent extension of primers in PGR is catalyzed by a polymerizing agent in the presence of 
adequate amounts of four deoxyribonucleoside triphosphates (dATP, dGTP, dGTP, and dTTP) in a reaction medium 
comprised of the appropriate salts, metal cations, and pH buffering system. 
55 [0071] The amplification method is useful not only for producing large amounts of a specific nucleic actd sequence 
of known sequence but also for producing nucleic acid sequences which are known to extst but are not completely spec- 
ified. One need know only a sufficient number of bases at both ends of the sequence in sufficient detail so that two oli- 
gonucleotide primers can be prepared which will hybridize to different strands of the desired sequence at relative 
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positions along the sequence such that an extension product synthesized fronn one primer, when separated from the 
template (complement), can serve as a template for extension of the other primer into a nucleic acid sequence of 
defined length. The greater the knowledge about the bases at both ends of the sequence, the greater can be the spe- 
cificity of the primers for the target nucleic acid sequence and the efficiency of the process. 

5 [0072] Any nucleic acid sequence, in purified or nonpurified form, can be utilized as the starting nucleic acid(s), pro- 
vided rt contains or is suspected to contain the specific nucleic acid sequence desired. Thus, the process may employ, 
for example, DNA or RNA, including messenger RNA, which DNA or RNA may be single-stranded or double-stranded. 
For example, if the template is RNA, a suitable polymerizing agent to convert the RNA into a complementary, copy-DNA 
(cDNA) sequence is reverse transcriptase (RT), such as avian myeloblastosis virus RT and Thermus thermophilus DNA 

w polymerase, a thermostable DNA polymerase with reverse transcriptase activity developed and manufactured by Hoff- 
mann-La Roche Inc. and marketed by Perkin Elmer (see PCT Patent Publication WO 91/09950). 
[0073] Whether the nucleic acid is single- or double-stranded, the DNA polymerase from Pyrodictium may be 
added at the denaturation step or when the temperature is being reduced to or is in the range for promoting hybridiza- 
tion Although the thermostability of Pyrodictium polymerase allows one to add the polymerase to the reaction mixture 

;5 at any time, one can substantially inhibit non-specific amplrfication by adding the polymerase to the reaction mixture at 
a point in time when the mixture will not be cooled below the stringent hybridization temperature. After hybridization, the 
reaction mixture is then heated to or maintained at a temperature at which the activity of the enzyme is promoted or 
optimized, i.e., a temperature sufficient to increase the activity of the enzyme in facilitating synthesis of the primer 
extension products from the hybrrdized primer and template. The temperature must actually be sufficient to synthesize 
a extension product of each primer which is complementary to each nucleic acid template, but must not be so high as 
to denature each extension product from its complementary template (i.e., the temperature is generally less than about 
80-90°C). 

[0074] Depending on the nucleic acid(s) employed, the typical temperature effective for this synthesis reaction gen- 
erally ranges from about 40-80°C, preferably 50-75°C. The temperature more preferably ranges from about 65-75''C for 
75 R occultum ad P. abyssi DNA polymerase enzymes. The period of time required for this synthesis may range from about 
0.5 to 40 minutes or more, depending mainly on the temperature, the length of the nucleic acid and the enzyme. The 
extension time is usually about 30 seconds to three minutes. If the nucleic acid is longer, a longer time period is gener- 
ally required for complementary strand synthesis. 

[0075] Those skilled in the art will know that the PGR process is most usually carried out as an automated process 
30 with a themnostable enzyme. In this process, the temperature of the reaction mixture is cycled through a denaturing 
region, a primer annealing region, and a reaction region. A machine specifically adapted for use with a thermostable 
enzyme is commercially available from Perkin Elmer 

[0076] Those skilled in the art will also be aware of the problem of contamination of a PGR by the amplified nucleic 
acid from previous reactions. Methods to reduce this problem are provided in U.S. patent application Serial No. 

35 609,157, incorporated herein by reference. 

[0077] PGR amplification may yield primer dimers or oligomers, double-stranded side products containing the 
sequences of several primer molecule joined end-to-end, the yield of which correlates negatively with the yield of ampli- 
fied target sequence. Non-specific priming and primer dimer and oligomer fonmation can occur whenever all of the PGR 
reagents are mixed, even at ambient and sub-ambient temperatures in the absence of thermal cycling and in the pres- 

^0 ence or absence of target DNA. At 37°G, for example, Taq retains approximate 1 -2% activity, although the optimal tem- 
perature IS about 75-80°C. f\/lethods for overcoming non-specific extension ad primer dimer formation include 
segregation of at least one reagent from the others in a way such that all reagents do not come together before the first 
amplification cycle. PCT Patent Publication No. WO 91/12342, which is incorporated herein by reference, describes 
methods and compositions for minimizing non-specific extension and primer dimer. 

45 [0078] Because of the extremely high optimum growth temperature of Pyrodictium species, the present invention 
provides compositions that may be useful for minimizing non-specific primer extension. Specifically, the optimal growth 
temperature for Pyrodictium occultum and P. abyssi is 100-105^C, approximately 30-35"G higher than, for example, 
Thermus aquaticus. Gonsequently. the residual activity of Pyrodictium DNA polymerases at room temperature is 
expected to be minimal and may eliminate the need to segregate at least one reagent prior to the first cycle of PGR. 

50 Thus, the present invention offers the potential of reduced non-specific extension at non-stringent annealing tempera- 
tures in a PGR without the use of wax barriers or other means of reagent segregation. 

[0079] Those of skill in the art will recognize that the present invention provides novel compositions for the practice 
of any methods for which a DNA polymerase has utility. In a preferred embodiment, the enzymes are useful for ampli- 
fying nucleic acid sequences by PGR. Other amplification methods, particularly those requiring a heat denaturation 
55 step such as PLGR (Barany, 1991. PGR Methods and Applications 1( 1) 5-1 3) or gap-LGR (see, for example, PGT Pat- 
ent Publication No. 90/01 069. published February 8, 1990) will also benefit from the present invention. Cycle sequenc- 
ing methods (Caruthers et al , 1989, BioTechniques 7:494-499, and Koop et al.. 1992, BioTechniques 14:442-447. 
incorporated herein by reference) will particularly benefit from 3'-5' exonuclease deficient Pab and Poc DNA pclymer- 
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ase enzymes. 

[0080] The present invention also provides kits comprising a thermostable DNA polymerase of the present inven- 
tion, preferably a stable enzyme composition comprising said polymerase in a buffer containing one or more non-ionic 
polymeric detergents, and optionally further reagents useful for performing a PGR reaction, such as a set of primers, 

5 probes or nucleoside triphosphate precursors. 

[0081) Pyrodictium DNA polymerase is very useful in carrying out the diverse processes in which amplification of a 
nucleic acid sequence by the polymerase chain reaction is useful. Such methods include cloning, DNA sequencing, 
reverse transcription and asymmetric PGR. Further, the enzymes of the invention are suitable for use in diagnostic, 
forensic, and research applications. The following examples are offered by way of illustration only and by no means 

10 intended to limit the scope of the claimed invention. 

Example 1 

Construction of a Genomic Pyrod i ctium Ab ys si DNA Library and identification of the Pab Polyme rase Gene by a Colony 
15 Blot Thermostable DNA Polymerase Activity Ass ay 

[0082] Pyrodictium abyssi cells were received from Dr. Karl O. Stetter, University Regensburg, Regensburg, Ger- 
many The isolate, AVZ (DSM6158) is described in Pley et al., 1991, System Applied Microbiology 14:245-253, which 
is incorporated herein by reference. DNA was purified by the method described in Lawyer et al., 1989, J. Biological 

20 Chemistry 264(1 1 ):6427-6437, which is incorporated herein by reference. About 25 \ig of Pyrodictium abyssi DNA was 
partially digested with the restriction enzyme SauSAI and stze-fractionated by gel electrophoresis. Ten ng of fragments 
which were larger than 3.5 kb and smaller than 8.5 kb were used for cloning into the BamHI site of ptJG1 9 vector (Clon- 
tech, Palo Alto, CA). The pUC19 plasmid vector has the lac promoter upstream from the BamHI cloning site. The pro- 
moter can induce heterologous expression of cloned open reading frames lacking promoter sequences. The 

25 recombinant plasmids were transformed into E. coll SURE cells (Strategene). Genotype of SURE® cells: mcrA, A 
(mcrBC-hsdRMS-mrr) 171, endA1, supE44, thi-1, X.-, gyrA96. relA1, lac, recB, recJ, sbcC, umuC::Tn5(kan^), urvC, (P. 
proAB, lactZAM15, Tn10[tet^]). 

[0083] A rapid filter assay for the detection of themnoresistant and thermophilic DNA polymerase activity was used 
to screen the Pyrodictium abyssi genomic DNA library (Sagner et al., 1991, Gene 97:1 19-123, incorporated herein by 

30 reference). According to the method, recombinant colonies are bound to nitrocellulose membrane and are incubated at 
elevated temperature in a polymerization buffer containing a[^^P]-labeled dNTPs. By autoradiography of the dried fil- 
ters, colonies which express thenmophilic DNA polymerase activity can be directly identified. The membrane-bound col- 
onies are heated to 95°C to irreversibly inactivate host DNA polymerases and are subsequently incubated at elevated 
temperatures to reveal the presence of thermophilic DNA polymerase activity. 

35 [0084] Approximately 500 colonies were plated per petri dish and grown overnight at 37°C. Subsequently, the col- 
onies were replica-plated onto nitrocellulose membranes and grown for 4 hours. The membranes were placed upside 
down on agarose plates which were placed for 20 minutes at room temperature on filter papers soaked with a mixture 
of chlorofonn/toulene (1:1). The membranes containing the pemneabilized colonies were then incubated at 95^C for 5 
minutes in a polymerization buffer containing 50 mM Tris-HCI pH 8.8, 7 mM MgCl2, 3 mM p-mercaptoethanol (pMe) to 

40 inactivate any nonthermoresistant (e.g., E. coll) DNA polymerase activity. Immediately alter in activation the membranes 
were transferred to the polymerization buffer containing 50 mM Tris-HCI pH 8.8, 7 mM MgCI^, 3 mM l3Me, 12 |iM dCTP, 
12^M dGTP, 12|aM dATP, 12 ^M dTTR and 1 jiCi per ml otpP]'dGTP After incubation for 30 minutes at 65"C the mem- 
branes were washed twice for 5 minutes in a solution of 5% (wA/) TCA and 1 % (wA^) pyrophosphate to remove unincor- 
porated rep^P]-dGTP The membranes were analyzed by autoradiography at -JO'^C. Seven clones were apparent on X- 

45 ray film of duplicated membranes after 3 days. 

[0085] Plasmid DNAs were isolated from these 7 clones, restriction analysis was performed to detemnine the size 
and orientation of insert flagments relative to the pUCI 9 vector DNA sequence analysis was performed on the largest 
clone, pPab14. The "universal" forward and reverse sequencing primers, Nos. 1212 and 1233, respectively, purchased 
from New England BioLabs. Beverly, MA, were used to obtain preliminary DNA sequences. From the preliminary DNA 

50 sequence, further sequencing primers were designed to obtain DNA sequence of more internal regions of the cloned 
insert. DNA sequence analysis has been performed for both strands. 

Example. 2 

55 E_xpress)on pfJhe_Pab Pplymerase_Gene 

[0086] Plasmid pDGl 68 is a XP{ cloning and expression vector that comprises the XP^ promoter and gene N ribos- 
ome-binding site (see, U S. Patent No. 4.71 1 .845, which is incorporated herein by reference), a restriction site polylinker 
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positioned so that the sequences cloned in to the polyiinker can be expressed under control of the APL-Nf^^, and a 
transcription terminator fornn the Bacillus thuringiensis delta-toxin gene (see, U.S. Patent No. 4,666,846, which is incor- 
porated herein by reference). Plasmid pDG168 also carries a nnutated RfMA II gene which renders the plasmid temper- 
ature sensitive for copy number (see, U.S. Patent No. 4,631,257, which is incorporated herein by reference) and an 

5 ampicillin resistance gene in E. coli K12 strain DG1 16. The construction of pDG168 is described in PCT Patent Publi- 
cation No. WO 91/09950, published July 11, 1991, at Example 6, which is incorporated herein by reference. 
[0087] These elements act in concert to provide a useful and powerful expression vector. At 30'32X, the copy 
number of the plasmid is low, and in a host cell that can-ies a temperature sensitive X repressor gene, such as cl857 the 
Pi promoter does not function. At 37-41 ""C, however, the copy number of the plasmid is 25 50 fold higher than at 30- 

10 32°C, and the cl857 repressor is inactivated allowing the promoter to function. Thus, pDG168 was selected for con- 
structing expression vectors for Pab DNA polymerase. 

[0088] The DNA sequence analysis of pPab14 revealed an open reading frame of 803 amino acids having an ATG 
start codon at nucleotide position 869 and a TGA stop codon at nucleotide position 3280. The 5' end of the Pab gene 
was mutagentzed with oligonucleotide primers AW397 (SEQ ID No. 5) and AW398 (SEQ ID No. 6) by PGR amplification 
15 (as described below). AW397 (SEQ ID No. 5) is fon/vard primer which was designed to alter the Pab DNA sequence at 
the ATG start to introduce an Ndel restriction srte. Primer AW397 (SEQ ID No. 5) also introduced mutations in the fifth 
and sixth codons of the Pab polymerase gene sequence to be more compatible with the codon usage of E. coli, without 
changing the amino acid sequence of the encoded protein. The reverse primer, AW398 (SEQ ID No. 6), was chosen to 
include a Spel site corresponding to amino acid position 1 74. In addition, a Kpnl site was introduced after the Spel site. 
[0089] The PGR reaction mixture contained 10 ng of Sail linearized pPab14 DNA as the template; 10 pmol of prim- 
ers AW397 (SEQ ID No. 5) and AW398 (SEQ ID No. 6); 50 of each dATR dCTP, cfTTR and dGTP; 2mM MgCl2; 10 
mM Tris-HCI, pH 8.3; 50 mM KG! and 1 unit Taq polymerase in 50 |il reaction volume. The reaction thermo-profile was 
95''C for 30 seconds; 65°C for 30 seconds and 72°C for 30 seconds and amplified for 1 2 cycles. The 500 bp amplified 
product was digested with Ndel and Kpnl and loaded on an 1% (w/v) Seakem agarose gel. The PGR product fragment 
was purified with Geneclean kit (Bio 101, San Diego, GA) and subcloned into expression vector pDG168, which had 
been digested with Ndel and Kpnl. The resulting clone was named pAW1 1 1 . The desired mutations were confirmed via 
restriction enzyme analysis and DNA sequence analysis. 

[0090] The 3' end of the Pab polymerase gene was modified via restriction enzyme digestion and use of a synthetic 
oligonucleotide duplex. AW399 (SEQ ID No. 7) was designed according to the 3' end of the Pab polymerase (pol) gene 

.30 from the Aflll site at amino acid position 785-786. It changes the TGA stop codon to TAA as well. AW400 (SEQ ID No. 
8) is the complementary strand of AW399 (SEQ ID No. 7) except that it has Xmal cohesive end at it's 5' end. When 
AW399 (SEQ ID No. 7) anneals to AW400 (SEQ ID No. 8), it produces a 60 bp synthetic duplex with 5' cohesive 
Afllt/Xmal ends. The duplex was then cloned into plasmid pPab2 that have been digested with Alfll and Xmal. The 
resulting plasmid was designated pAW113. Plasmid Pab2 was one of the 7 clones isolated from the genomic library as 

35 described in Example 1. Plasmid Pab2 contains the entire Pab pol gene but is -250 bp shorter than Pab14 at the 5' 
end. Thus, it lacks a flanking 5'-end Alfll site which facilitated the cloning strategy of replacing the 3' end Aflll - Xmal 
fragment with the synthetic duplex AW399 (SEQ ID No. 7)/AW400 (SEQ ID No. 8) as described above. The DNA 
sequence of the replaced fragment was confirmed by DNA sequence analysis. 

[0091] Finally, the 1 .89 kb fragment of the Pab polymerase gene region, Spel through the stop codon was isolated 
■w from pAW113 by digestion with Spel and Xmal, and purified via gel electrophoresis. The resulting fragment was ligated 
with plasmid pAW1 1 1 that had been digested with Spel and Xmal. 

[0092] The ligation condition was 20 pg/ml DNA, 20 mM Tris-HGt, pH 7.4, 50 mM NaCI, 10 mM MgCl2, 40 |iM ATP 
and 0.2 Weiss unit T4 DNA ligase per 20 |iil reaction at 16X overnight. Ligations were transformed into DG116 host 
cells. Candidates were screened for appropriate restriction enzyme sites. The desired plasmid was designated 
45 pAW115. 

[0093] The oligonucleotides used in this example are shown below. 

AW397 SEQ ED No. 5 y-GGACCCATATGOCAGAAGCTATTCAATTCGrcCTCC 
AW398 SEQ ID No. 6 5*<K3CA(XiTA(XACTACnTATGTOGGGAATAGGCTC 
AW399 SEQ ID No. 7 y-TTAACKjCAGCATCATCTCKXSCATACWAGTCT. 

CTIXXJACTTCTTCGCGGCAAAGAAGTAAC 
AW400 SEQ ID No. 8 5-<XXXXrrrACTIXTrrGCCXXX3AAGAAGTCGAAGAGACT- 

CCTATGCCCAGATGATGCTGCC 



16 



EP 0 624 641 B1 



Exampie 3 

5 Clp_ni ngJheJ^xrodMiunn Occultum IPoc) DNA Polymerase Gene 

[0094] Pab and Poc genomic DNA (0.5 |ig each) were digested with Hindlll, and were separated by gel electro- 
phoresis through an 0.8% (wA/) agarose gel. Pyrodictium occuftum cells were received from Dr. Karl O. Stetter, Univer- 
sity Regensburg, Regensburg, Germany. DNA was purified by the method described in Lawyer et al., 1 989, J. Biological 

10 Chemistry 264 (11 ):6427-6437. which is incorporated herein by reference. The DNA fragments in the gel were dena- 
tured in 1 .5 M NaCl and 0.5 M NaOH solution for 30 minutes and were neutralized in a solution of 1 M Tris-HCI, pH 8.0 
and 1.5 M NaCI for 30 minutes, and then were transferred to a Biodyne nylon membrane (Pall Biosupport, East Hills, 
NY) using 20 x SSPE (3.6 M NaCI, 200 mM NaPOVpH 7.4, 20 mM EDTA/pH 7.4). The DNA attached to the membrane 
was then hybridized to a ^^P-!abeled 240 bp PGR product which encoded amino acids 515-614 of the Pab polymerase 

15 gene. The prehybridization solution was 6 x SSPE, 5X Denhardt's reagent, 0.5% (w/v) SDS, 100 ^ig/ml denatured, 
sheared, salmon sperm DNA. Hybridization solution was the same except that Denhardt's reagent was used at 2X, and 
contained 10^ cpm ^^P-labe!ed PCR-amplified probe. Prehybridization and hybridization were both at 55°C. The blot 
was washed sequentially as follows: 2 x SSPE, 0.5% (w/v) SDS, 10 minutes at room temperature; 2 x SSPE, 0.1% (w/v) 
SDS, 15 minutes at room temperature; 0.1% (wA^) SSPE, 0.1% (wA^) SDS, 5 minutes at room temperature. 

20 [0095] A strong signal was apparent at approximately 3.8 kb in the Hindlll digest. This suggested that the Poc 
polymerase gene has homology with the Pab polymerase gene. Consequently, several PGR primers, designed from the 
Pab polymerase gene sequence, were evaluated for amplification of portions of the Poc polymerase gene. A specific 
PGR product, 295 bp in size resulted from a PGR using primer pair LS41 7 (SEQ ID No. 34) and LS396 (SEQ ID No. 35). 

(SEQ ID No. 35). 

LS4 1 7 SEQ ID No. 34 5'<3 ATAAAGATAG ACAAGGTATAC 

LS396 SEQ ID No. 35 5'-(XTATTCCrCGATTCTCrTT 

3^ AW394 SEQ ID No. 9 S'^jCTTATAGCCTTGTCCACGTTC 

The PGR was perfonned at final concentration of 1 X PGR buffer, 50 dNTPs, 0.1 laM each primers, 1 .25 units Taq 
in a total volume of 50 1 X PGR buffer contains 20 mM Tris pH 8.4, 50 mM KGI, 2 mM MgGl2. The reaction was ampli- 
fied for 35 cycles. 

35 [0096] The 295 bp PGR product was then subjected to DNA sequence analysis. The DNA sequence result showed 
that the Poc polymerase gene has 78% identity with the Pab polymerase gene in this region. A Poc polymerase specific 
oligonucleotide probe AW394 (SEQ ID No. 9) was designed using this DNA sequence dab The ^^P-!abeled AW394 
(SEQ ID No. 9) was then used to screen a genomic Poc DNA bank to obtain Poc polymerase clones. The constriction 
of the genomic Poc DNA bank was as described in Example 1 for the genomic Pab DNA bank. 

40 [0097] About 5,500 ampicillin-resistant colonies were selected on nitrocellulose filters and hybridized with ^^P- 
labeled AW394 (SEQ ID No. 9). Plasmid DNA was isolated from 6 colonies that hybridized with the probe. Prehybridi- 
zation and hybridization conditions were as described above. Wash conditions were 6 x SSPE, 0.1% (w/v) SDS for 5 
minutes at room temperature and followed by 2 x SSPE, 0.1% (w/v) SDS for 15 minutes at bb^C. Restriction enzyme 
analysis and PGR analysis were performed to determine the size and orientation of insert fragment relative to the 

45 pUG19 vector. The results revealed that pPoc3 and pPoc5 are identical clones. The sizes of the coding region, 5' end 
non-transtated region and 3' end non-translated region of all identified Poc polymerase clones are listed below. 



Coding Region 


5''end 


3'-end 


pPoc1 1.9 kb 


0 


3,6 kb 


pPoc2 1.9 kb 


0 


4 2 kb 


pPoc4 2.4 kb 


0.4 kb 


0 7 kb 


pPoc5 0.35 kb 


0 


4.5 kb 


pPoc6 0 35 kb 


0 


3 2 kb 
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(continued) 



Coding Region 


5'-end 


S'^end 


pPoc8 0.7 kb 


3kb 


0 



5 

DNA sequence analysis was perfornned on pPoc4. Universal and reverse sequencing primers were used to obtain pre- 
liminary DNA sequence infonriation. From this DNA sequence additional sequencing primers were designed to obtain 
the DNA sequence of more internal regions of the insert DNA sequence analysis has been performed for both strands. 

w Example 4 

Expression of the Poc Polymerase Gene 

[0098] The 5' end of the Poc polymerase gene in plasmid pPoc4 was mutagenized with oligonucleotide primers 

15 AW408 (SEQ ID No. 10) and AW409A (SEQ ID No. 1 1) via PGR amplification. AW408 (SEQ ID No. 10) is a forward 
primer designed to alter the DNA sequence of the Poc gene at the ATG start codon to introduce an Nsil restriction site. 
AW408 (SEQ ID No. 10) also was designed to introduce alterations in the second, third, fifth, and sixth codons of the 
Poc gene to provide a sequence more compatible with the codon usage of E. coli without changing the amino acid 
sequence of the encoded protein. The reverse primer AW409A (SEQ ID No. 11) was chosen to include a Xbal site at 
amino acid position 38. In addition, a Kpnl site was introduced after the Xbal site for subsequent subcloning. 
[0099] Plasmid pPoc4, linearized with Kpnl, was used as the PGR template for amplification using the AW408 (SEQ 
ID No. 10)/AW409A (SEQ ID No. 1 1 ) primer pair, yielding a 138 bp PGR product. The PGR amplification procedure was 
as described above at Example 2. The amplified fragment was digested with Nsil, then treated with Klenow to create a 
blunt end at the Nsil-cleaved end, and finally digested with Kpnl. The resulting fragment was ligated with expression 

25 vector pDG164 (which is described in detail in PCT Patent Publication No. WO 91/09950, at Example 6b, and incorpo- 
rated herein by reference) that has been digested with Ndel, repaired with Klenow, to fil! in the overhang ad provide a 
blunt end for ligation, ad then digested with Kpnl. The ligation yielded an in-frame coding sequence of the 5' end of the 
Poc polymerase gene under control of the XP[ promoter and bacteria phage T7 gene 10 ribosome binding srte. The 
resulting construct was designated pAW1 18. 

30 [0100] To effect subcloning of the 3' end of the Poc polymerase gene, a Kpnl site was introduced after the stop 
codon. This was done by a PGR process as follows. The forward primer was chosen to include a Espl site at amino acid 
position 698-699, and the reverse primer was designed to incorporate a Kpnl site immediately following an altered stop 
codon (TAA). The amplified 335 bp fragment was digested with Espl and Kpnl, and cloned into plasmid pPoc4 digested 
with Espl ad Kpnl. The resulting construct was designated pAW120. 

35 [0101] Finally, the Poc pol gene region Xbal through the stop codon was isolated from pAW120 by digestion with 
Xbal ad Kpnl. The resulting 2.3 kb fragment was ligated wrth pAW118 that had been digested with Xba ad Kpnl. The 
ligation product was transformed into DG 1 1 6 host celts for expression ad designated pAWI 21 . 
[0102] The oligonucleotides used in this example are given below. 

AW408 SEQro No. 10 GGACCATGCATGACTCAAACTATTGAATTC^ 

AW409A SEQ ID No. 1 1 GGAAGGTACCTGATCATCTAGAAGCACGACACGTT 

AW410 SEQ ID No. 12 GGAAGCTGAGCAAGAGGATAGAGG 

45 



AW411A SEQ ID No. 13 GGAAGGTACCTTATTIXnTrGAGGCGAAGAAG 



Example 5 

55 

E xp re ssio n o.f_Pa b p oLG e_n e_an d Poc p oj G e n e i n Jr/ptoph a n P rgnn ote.r Yect 0 r 

[0103] Both the Pab poi gene and the Poc pol gene can be over-expressed under the control of the E. coti Trp pro- 
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moler. Construction of the expression clones was performed as follows: The XPi promoter in expression clone, 
pAWl15, was replaced by a Trp promoter sequences wfiich was generated by PGR amplrfication using plasmid 
pLSGIO (plasmid pLSGlO is described in U S. Patent No. 5,079,352, which Is incorporated herein by reference), as 
template and AW500 (SEQ ID No. 14) and AW501 (SEQ ID No. 15) as primers. The resulting PGR product was 
digested with NspV and Ndel and cloned into NspV and Ndel digested pAW1 15 to give rise to a Pab pol expression 
clone, pAW1 18, under control of the E. coli Trp promoter 

[0104] An internal Ndel site In the Poc pol gene of pAW121 , complicates of the exchange NspV - Ndel XP( promoter 
fragment and the Trp promoter fragment. Therefore, primers AW500 (SEQ ID No. 14) and AW502 (SEQ ID No. 16) were 
designed to amplify the Trp promoter sequence fragment from pLSGIO and primers AW503 (SEQ ID No. 17) and 
AW504 (SEQ ID No. 18) were designed to amplify the 5" end 1 10 bp Ndel-Xbal fragment from pAW121 . AW502 (SEQ 
ID No. 16) and AW503 (SEQ ID No. 17) overlap by 9 nucleotides. Using overlap extension PGR, the Trp promoter frag- 
ment and the 5' end 11 0 bp fragments were fused. The resulting PGR product was digested with NspV and Xbal and 
cloned Into pAW121 which had been was digested with NspV and Xbal. The resulting Poc pol expression clone was 
named pAW123. 



AWSOO 
AW501 
AW502 
AW503 
AW504 



SEQ ID No. 14 
SEQ ID No. 15 
SEQ ID No. 16 
SEQ ID No. 17 
SEQ ID No. 18 



TmrCGAAAGAAGAAAAAACC 

TCrCATATGCTTATCGATACCC 

CATAAGCTTATCGATACXXnT 

AAGCTTATGACAGAGACTATAGAGTr 

GTGGTCTAGAAGCACGACACX3T 



Exampl e 6 

30 Assessm ent of 3'-5' Exonuclease Activity: A Fide lit y Assay 

[0105] Because of the dramatic levels of amplification provided by the PGR process (up to 10^ Mo 6 x 10^^-fold). 
for certain applications the accuracy of replication (fidelity) is important. PGR fidelity is based on a two step process; 
misinsertion and misextension. If the DNA polymerase inserts an incorrect base and the resulting 3'-mismatched temni- 
35 nus is not extended, this truncated extension product cannot be amplified since the binding site for the downstream 
primer is not present. DNA polymerases extend a mismatched 3'-terminus more slowly than a matched 3'-terminus. In 
addition, different mismatches extend at disparate rates. See Kwok et al., 1990, Nuc. Acids Res. 18:999-1005, and 
Huang et al., 1992, Nuc. Acids Res. 20:4567-4573. 

[0106] DNA polymerases with inherent 3' to 5' exonuclease or proofreading activity are able to improve fidelity by 

40 removing misinserted bases before extension. A convenient PGR and restriction endonuclease digestion assay has 
been developed to assess the ability of DNA polymerases with 3' to 5* exonuclease activity to remove 3' -terminal mis- 
matched nucleotides prior to misextension. Several primers were designed which were either perfectly matched or 3'- 
mismatched (with every possible combination) to the first nucleotide of the BamHI restriction enzyme recognition 
sequence in the Thermus aquaticus DNA polymerase gene (Lawyer et al., 1989, J. Biol. Chem. 264:6427-6437 and 

45 U.S. Patent No. 5,079,352). The perfect match primers. FR434 (SEQ ID No. 29) and FR438 (SEQ ID No. 33), amplify 
a 151 bp product that is completely digested with BamHI restriction enzyme to generate 132 bp and 19 bp DNA frag- 
ments. The 3'-terminal nucleotide of forward primer FR434 (SEQ ID No. 29) corresponds to nucleotide 1 778 of the Taq 
DNA pol gene. Forward primers FR435 (SEQ ID No. 30), FR436 (SEC ID No. 31), and FR437 (SEQ ID No. 32) contain 
a single 3' terminal mismatch with respect to the wild-type Taq DNA pol gene and wild-type primer FR438 (SEQ ID No. 

50 33) extension products, corresponding to A:C, T:C, and C:C mismatches, respectively. Any incorrect or misextension 
from primers FR435 (SEQ ID No. 30), FR436 (SEQ ID No. 31), or FR437 (SEQ ID No. 32) eliminates the BamHI rec- 
ognition site corresponding to nucleotides 1 778 - 1 783 of the Taq DNA pol gene. Alternatively, exonucleolytic proofread* 
ing removes the 3* terminal mismatched nucleotides and permits incorporation of the correct dG residue, resulting in 
the accumulation of PGR products that now contain the diagnostic BamHI restriction enzyme site. Since all of the 

55 FR435 (SEQ ID No. 30), FR436 (SEQ ID No. 31), or FR437 (SEQ ID No. 32) primers are mismatched to the original 
target, this PCR/endonuc!ease digestion assay requires exonucleolytic proofreading in every cycle to correct the 
"mutant" primers and generate a PGR product that contains the diagnostic BamHI cleavage site, Misextension at any 
cycle will generate an efficiently copied (now mutant) template in the succeeding cycle (from primer FR438 [SEQ ID No. 
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33] extension) that is perfectly matctied to all of the primers in the assay. 



* 



FR434 



SEQ ID No. 
SEQ ID No. 
SEQ ID No. 
SEQ ID No. 
SEQ ID No. 



29 



5-GCACCCCGCTTGGGCAGAG 

5'-GCACCCCXjCTTGGGCAGAA 
S-GCACCCCGCTTGGGCAGAT 

5'-GCACCCCGCTTGGGCAGAC 
5-TCCCGCCCCTCCTGGAAGAC 



5 



FR435 



30 



FR436 



31 



FR437 



32 



10 



FR438 



33 



15 [0107] Primer FR434 (SEQ ID No. 29) corresponds identically to nucleotides 1760 through 1778 of the Taq DNA 
polymerase gene, and primer FR438 (SEQ ID No. 33) is complementary to nucleotides 1891 through 1910 of the Taq 
DNA polymerase gene. Primers FR435 (SEQ ID No. 30), FR436 (SEQ ID No. 31), and FR437 (SEQ ID No. 32) corre- 
spond identically to nucleotides 1760 through 1777 of the Taq DNA polymerase gene and contain the indicated (by *, 
underlined) 3'-temninal mismatched nucleotide at position 1778. 

20 [0108] Recombinant Pab and Poc DNA polymerases were purified from E. coli K12 strain DG1 16 harboring plas- 
mids pAW115 or pAW121, respectively. The purification involved cell lysis, heat treatment at 75-85°C, potymin P pre- 
cipitation of bulk nucleic acids, Phenyl Sepharose chromatography and Heparin Sepharose chromatography, according 
to Example 9. 

[0109] Using this fidelity assay, wild-type recombinant Pab and Poc DNA polymerases are able to con-ect mismatch 
25 primers FR435 (SEQ ID No. 30), FR436 (SEQ ID No. 31) and FR437 (SEQ ID No. 32) to generate PGR product that 
contains the requisite BamHI cleavage site, demonstrating the presence of 3' to 5' exonucleolytlc proofreading activity. 

Example 7 

30 Pr oduction of 3'-5' exonuclease mutants of Pab pol and Poc pol 

[0110] Pab and Poc pol genes lacking 3'-5' exonuclease activity were constructed using srte-directed mutagenesis 
by overlap extension PGR to alter the codons for Aspl 87 and Glu189 to code for alanine. Briefly, mutagenesis by over- 
lap extension PGR involves the generation of DNA fragments that, by virtue of having incorporated complementary 

35 oligo primers in independent PGR reactions (see, Higuchi et a!., 1988, Nuc. Acids Res. 16:7351-7367, and Ho et al., 
1989, Gene 77:51 -59, which are incorporated herein by reference, for a detailed description of this method). According 
to the method, these fragments are combined in a subsequent "fusion" reaction in which the overlapping ends anneal, 
allowing the 3' overlap of each strand to serve as a primer for the 3' extension of the complementary strand The result- 
ing fusion product is amplified further by PGR. Specific alterations in the nucleotide sequence can be introduced by 

40 incorporating nucleotide changes into the overlapping oligo primers. 

[0111] The construction of a 3'-5' exonuclease minus mutant of Pab was accomplished as follows. The two over- 
lapped primers AW493 (SEQ ID No. 20) and AW494 (SEQ ID No. 21) were designed to span Asp187 and Glu189, in 
which both Aspl 87 and Gtu189 are replaced by alanine. The two external primers, AW492 (SEQ ID No. 19) and AW495 
(SEQ ID No. 22), were chosen to locate at the unique Spel and Nsil restriction sites at amino acid position 1 74-175 and 

45 amino acid position 304-305, respectively, thus making it possible to ligate the fusion product back into the expression 
vector, The products from the PGR using primer sets AW492 (SEQ ID No. 19)/AW493 (SEQ ID No. 20) and AW494 
(SEQ ID No. 21)/AW495 (SEQ ID No. 22) were 70 bp and 373 bp fragments, respectively. The resulting two fragments 
(27 nucleotide 3' overlap) were fused by denaturing and annealing them in a subsequent primer extension reaction. The 
416 bp fusion product was amplified further by PGR using the two external primers AW492 (SEQ ID No. 19) and AW495 

50 (SEQ ID No. 22). The mutagenized 416 bp fragment was then cut with Spel and Nsif and ligated back into the parent 
clone pAW1 1 5 which had also been digested with Spel and Nsil. The resulting mutant clone was named pexo-Pab, and 
the desired mutations were confirmed by sequence analysis. 

[0112] Similarly, the 3'-5' exonuclease minus mutant of Poc was constructed using the same approach. The over- 
lapping pnmer pair used to introduce the mutation are AW489 (SEQ ID No, 24) and AW490 (SEQ ID No. 25). The two 
55 external primers, AW488 (SEQ ID No. 23) and AW491 (SEQ ID No. 26) are located at the unique Xbal and BssHII 
restriction sites at amino acid positions 37-39 and 260-262. respectively. The products from PGR using primer sets 
AW488 (SEQ ID No. 23)/AW489 (SEQ ID No. 24) and AW490 (SEQ ID No. 25)/AW491 (SEQ ID No. 26) were 476 bp 
and 243 bp fragments, respectively These two fragments were fused and subjected to PGR amplification using the 
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external primers AW488 (SEQ ID No. 23) and AW491 (SEQ ID No. 26). The rnutagenized fragment was then cut with 
Xbal and BssHII and ligated back into the parent clone pAW121 . The resulting mutant clone was named pexo-Poc. 
[01 1 3] The exonuclease activities of the exo- Pab DNA polymerase and exo- Roc DN A polymerase were determined 
using the mismatch incorporation proofreading assay. The results showed that both the exo-Pab pol and exo-Poc pol 
5 lacked the 3'-5' exonuclease activity. 





AW492 


SEQ ID No. 19 


5'-TATTGCCGACATAACTAGTATAGA 




AW493 


SEQ ID No. 20 


5'-ACTGTAGACCC}CGATCGCGAACX}CGAGC 


w 


AW494 


SEQ ID No. 21 


5'-CTCGCGTTC<jCGATCGCGGTCTACAGTAAGAGAG 




AW495 


SEQ ID No. 22 


5'-TTATCTCATGCATTTCCTCC 




AW488 


SEQ ID No. 23 


S'-GTGTCGTGCllUl AGACCA 


15 


AW489 


SEQ ID No. 24 


5'-GCTATACACCGCGATCGCAAAAGCTACCAGC 




AW490 


SEQ ID No. 25 


S'^jGTAGCTTTTGCGATCGCGGTGTATAGCAGGA 




AW491 


SEQ ID No. 26 


5'-TACGGGCGCGCTCCATTAG 
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Example 8 

25 The rm_ostabirrty comparison of Pab ppL_Poc pol a nd Tag pol i n PC R 

[0114] The upper growth temperature of hyperthermophilic genus Pyrodictium is 1 )0°C. To test the thermostability 
of purified recombinant Pab pol, Poc pol and Taq pol in the PGR process, the following experiment was performed: 0.1 
pg, 1 pg, and 10 pg of Ml 3 DNA (New England Biolabs, Beverly, MA) were used as templates for PGR analysis by Pab, 
30 Poc and Taq. The factions were subjected to 25, 30, 35 and 40 cycles at denaturing temperatures of 95''C or lOCC. A 
PGR product of 350 bp was generated by using BW36 (SEQ ID No. 27) and BW42 (SEQ ID No. 28) as primers. 

BW36 SEQ ID No. 27 5'-C(XATAGTTTGAGTTCTTCrACTCAGGC 
35 BW42 SEQ ID No. 28 5'-GAAGAAAGCGAAAGGAGCGGGCGCTAGGGC 

PGR was performed at a final concentration of 1 x PGR buffer, 50 (iM dNTPs, 0.1 ).iM each primers, 0.25 units Pab or 
0.1 units Poc or 1 .25 units Taq in a total reaction volume of 50 ^1. 

40 [011 5] A unit of Pab DNA polymerase and a unit of Poc DNA polymerase is defined, like for Taq DNA polymerase, 
as the amount of enzyme that will incorporate 10 nmoles total dNTPs into acid insoluble materia! per 30 minutes at 
74''C. Poc and Pab DNA polymerases are assayed as described in U.S. Patent No. 4,889,818, which is incorporated 
herein by reference, for Taq DNA polymerase with the following changes in reaction components. Pab DNA polymerase: 
Tris-HCI pH 8.3 (25°G), 100 mM KCI, 5 mM MgGlp. Poc DNA polymerase: Tris-HCI pH 8.0 (25°G), 10 mM KCL 5 mM 

45 MgClp. 1 x PGR buffer for Pab contains: 20 mM Tris-HGI, pH 8.4, 100 mM KCI, 1.5 mM MgGl^. 1 x PGR buffer for Poc 
contains: 20 mM Tris-HCI, pH 8.4, 10 mM KCI, 1.0 mM MgCI?. 1 x PGR buffer for Taq contains: 20 mM Tris, pH8.4, 50 
mM KCI, 1.5 mM MgCl2. The amplification profile involved denaturation at 95°G or lOO'^C for 30 seconds, primer 
annealing and extension at 55''G for 30 seconds. The results showed that both Pab pol and Poc pol were extremely 
thermoresistant, functioning effectively in the PGR with denaturing temperature up to 100°C. In contrast, Taq pol pro- 

50 duced no product under these conditions at 1 0O^G, 

Example 9 

Purification of Recombinant Pyrodictium DNA Poly m_erase 

[0116] Recombinant Pyrodictium DNA polymerase is purified as follows. Briefly, cells are thawed in 1 volume of TE 
buffer (50 mM Tris-HCI, pH 7.5, and 1 ,0 mM EDTA with 1 mM DTT), and protease inhibitors are added PMSF [phenyl- 
■rethylsuifonyl fluoride) to 2.4 mM, leupeptin to 1 ng^'ml, and TLCK [(-)- 1 -chloro-3-tosylamido-7-amino-2-heptanone 
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hydrochloride] to 0.2 rriM). The cells are lysed in an Aminco french pressure cell at 20,000 psi and sonicated to reduce 
viscosity. The sonicate is diluted with TE buffer and protease inhibitors to 5.5 X wet weight cell nnass (Fraction 1), 
adjusted to 0.2 M amnnoniunn sulfate, and brought rapidly to 85°C and maintained at 85''C for 15 minutes. The heat- 
treated supernatant is chilled rapidly to O^C, and the E. coll eel! membranes and denatured proteins are removed fol- 

5 lowing centrifugation at 20,000 X G for 30 minutes. The supernatant containing Pyrodictium DNA polymerase (Fraction 
II) is saved. The level of Polymin P necessary to precipitate >95% of the nucleic acids is determined by trial precipitation 
(usually in the range of 0.6 to 1% w/v). The desired amount of Polymin P is added slowly with rapid stirring at O^'C for 
30 minutes and the suspension centrifuged at 20,000 X G for 30 min. to remove the precipitated nucleic acids. The 
supernatant (Fraction III) containing the Pyrodictium DNA polymerase is saved. 

10 [0117] Fraction III is adjusted to 0.3 M ammonium sulfate and applied to a Phenyl Sepharose column that has been 
equilibrated in 50 mM Tris-HCI, pH 7.5, 0.3 M ammonium sulfate, 1 0 mM EDTA, and 1 mM DTT. The column is washed 
with 2 to 4 column volumes of the same buffer (A^gQ baseline), and then 1 to 2 column volumes of TE buffer contain- 
ing 50 mM KCI to remove most contaminating E. coli proteins. Pyrodictium DNA polymerase is then eluted from the col- 
umn with buffer containing 50 mM Tris-HCI, pH 7.5, 2 M urea, 20% (w,v) ethylene glycol, 10 mM EDTA, and 1 mM DTT, 

15 and fractions containing DNA polymerase activity are pooled (Fraction IV). 

[0118] Final purification of recombinant Pyrodictium DNA polymerase is achieved using Heparin Sepharose chro- 
matography, anion exchange chromatography, or Affigel blue chromatography. Recombinant Pyrodictium DNA polymer- 
ase may be diafiltered into 2.5X storage buffer (50 mM Tris-HCI pH 8.0, 250 mM KCI, 2.5 mM DTT, 0.25 mM EDTA, 
0.5% [wA/] Tween20), combined with 1 .5 volumes of sterile 80% (wA/) glycerol, and stored at -20°C. 

PO 

Example 10 

T hermost ability of Pyrodi ctium occu ltum DNA p_o lyme rase 

P5 [0119] The thermal stability of the Pyrod'tctium occultum DNA polymerase was assessed by measuring the activity 
alter incubations at 100°C for varying lengths of time. The DNA polymerase was incubated m a mixture intended to 
mimic PGR amplification conditions, but chosen such that no DNA synthesis occurred. The enzyme mixture contained 
the following reagents: 

30 10 mM Tns-HCI pH 8.0 

50 mM KCI 
200 |iM dATP 
1 mM MgCI^ 

0.1 ^g single-stranded DNA 
35 20 pmoles primer (30 base oligomer) 

[0120] To measure activity, 5 ^1 of incubated enzyme mixture was added to 45 ^il of reaction mixture consisting of 
the following reagents: 

40 lOmMTnspHS.O 
6 mM MgCl2 
75 mM KCI 

1 mM beta-mencaptoethanol 
200 ^iM each dATR dTTR and dGTP 
45 200 pM [a-3^P]dCTP 

2.5 [ig activated salmon sperm DNA 

Activity was measured as the amount of dNMP incorporated in 10 minutes at 75°C. 

[0121] In one experiment, incubations were carried out for 0, 1 , 2, and 4 hours. Reactions mcubated for less than 4 
50 hours were held on ice until all incubations were completed so that all activity assays were carried out together The 
measured activities are provided below represented as the fraction of the initial activity remaining after each high tem- 
perature incubation. 



Hours 


Relative Activity (%) 


1 


93 
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(continued) 


Hours 


Relative Activity (%) 


2 


116 


4 


104 



A sinnilar experinnent was can-ied out using incubations of 0, 1, 2, 3. 4, 6, 7, and 8 hours at lOC'C; the results are pro- 
vided below. 

10 



Hours 


Relative Activity (%) 


1 


86 


2 


82 


3 


86 


4 


67 


6 


82 


7 


104 


8 


104 



,^5 

No detectable loss in activity was observed even after an 8 hour incubation at 1 00°C. The thermal stability of the DNA 
polymerase from Pyrodictium abyssi is expected to be similar. 

Example 1 1 

30 

Exo-minus Deletion Mutants 



[0122] Amino(N)-termina! deletion mutant DNA polymerases were created which lack exonuclease activity while 
retaining polymerase activity. Three "mini" Pab DNA polymerases were produced in which 366, 386, and 403 amino 
acids were deleted, producing 48, 46, and 44 kilodalton (kDa) proteins, respectively. The mutant polymerase genes 
were created and expressed as described below. 

[0123] Subsequences of the full length sequences that encodes the Pab DNA polymerase were amplified from 
expression plasmid pAW115 using the primers shown below. Each of the upstream primers, AW594, AW593, and 
AW576, introduces an ATG start codon, introduces an Nde I restriction site before the ATG start codon, and introduces 
some alterations in the first six codons to provide a sequence more compatible with the codon usage of E. coli without 
changing the amino acid sequence of the encoded protein. Primer AW594 introduces the ATG start codon between 
amino acid positions 367 and 368, resulting in a 366 amino acid deletion mutant. Similarly, primer AW593 introduces 
the ATG start codon between amino acid positions 387 and 388, resulting in a 386 amino acid deletion mutant, and 
primer AW576 introduces the ATG start codon between amino acid positions 404 and 405, resulting in a 403 amino acid 
deletion mutant. A single downstream primer, AW577, was used for each amplification that includes an Apa I site cor- 
responding to amino acid position 454. The sequences of the primers are provided below, shown 5' to 3'. 

Primer Scq ID No . Sequence 
AW594 36 TTCGCATATGCXATTTGCAATACAACTITOG^ 
AW593 37 TTCGCATATGGGTGTAGGTTTTi^TCTAGAATGGTAC 

AW576 38 CGCATATGAACX3AACTGG1TCCCAACCGTGTCAAG 
AW577 39 GTCAGGGCCCACATTGTACTT 
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[0124] Each amplification was earned out in a 50 \i\ reaction volume using 1 00 pg of linearized pAW1 1 5 as template 
under the following conditions: 1 0 pmol each primer; 50 each dNTR 1 .5 mM MgCl2; 1 0 mM Tris-HCI, pH 8.8; 10 mM 
KCI; and 1 U UlTma DNA polymerase (Perkin Elmer, Norwalk, CT). The temperature profile for the amplification was 20 
cycles each consisting of 95'C for 30 seconds and 55'C for 30 seconds. 

5 [0125] The amplified products were digested with Nde I and Apa I and purified using agarose gel electrophoresis. 
The purified products were subcloned into pAW1 15 which had been digested with Nde I and Apa I, thereby replacing 
the original 1364 base fragment with either the 266, 206, or 155 base amplified inserts. The resulting clones were 
named pAW126 (403 amino acid deletion mutant), pAW129 (386 amino acid deletion mutant), and pAW130 (366 amino 
acid deletion mutant). The DNA sequences of the replaced fragments were confirmed by DNA sequence analysis. 

w [0126] Each of the resulting expression vectors were expressed in E. coli essentially as described in the previous 
examples. The expression of the 48 and 46 kDa proteins was moderate, whereas the expression of the 44 kDa protein 
was very high. Crude, heat treated extracts of each protein showed polymerase activity using the activity assay 
described in Example 10. 

15 ATCC Deposits 

[0127] The following bacteriophage and bacterial strains were deposited with the American Type Culture Collec- 
tion, 1 2301 Parklawn Drive, Rockville, Maryland, U.S.A. (ATCC). These deposits were made under the provisions of the 
Budapest Treaty on the International Recognition of the Deposit of Microorganisms for purposes of Patent Procedure 
and the Regulations thereunder (Budapest Treaty). This assures maintenance of a viable culture for 30 years from the 
date of deposit The organisms will be made available by ATCC under the terms of the Budapest Treaty, and subject to 
an agreement between Applicants and ATCC that assures unrestricted availability upon issuance of the pertinent U.S. 
patent and/or publication of foreign patents or patent applications. Availability of the deposited strains is not to be con- 
strued as a license to practice the invention in contravention of the rights granted under the authority of any government 
.^5 in accordance wrth its patent laws. 



Deposit Designation 


ATCC No. 


Date of Deposit 


pPab14 
pPoc4 


69310 
69309 


05/11/93 
05/11/93 



35 [0128] The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice 
the invention. The present invention is not to be limited in scope by the cell lines deposited, since the deposited embod- 
iment is intended as a single illustration of one aspect of the invention and any cell lines that are functionally equivalent 
are within the scope of this invention. The deposit of materials therein does not constitute an admission that the written 
description herein contained is inadequate to enable the practice of any aspect of the invention, including the best mode 

40 thereof, nor are the deposits to be construed as limiting the scope of the claim to the specific illustrations that they rep- 
resent. Indeed, various modifications of the invention in addition to those shown are described herein will become 
apparent to those skilled in the art from the foregoing description and fall within the scope of the appended claims. 



45 



50 
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SEQUENCE LISTING 



40 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: F.Hoffmann-La Roche AG 

(B) STREET: Grenzacherst ra sse 124 

(C) CITY: Basel 
;o (D) STATE: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : CH-4002 

(G) TELEPHONE: (0)61 688 24 03 

(H) TELEFAX: (0)61 688 13 95 

(I) TELEX: 9622 92/9 65542 hlr ch 

15 

(ii) TITLE OF INVENTION: Thermostable Nucleic Acid Polymerase 

(iii) NUMBER OF SEQUENCES: 39 

(iv) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release fl.O, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/062,368 

(B) FILING DATE: 14-MAY-1993 

(2) INFORMATION FOR SEQ ID NO : 1 : 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2430 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGCCAGAAG CTATAGAGTT CGTGCTCCTT GATTCAAGCT ACGAGATTGT AGGGAAAGAG 60 

CCGGTAATCA TACTATGGGG TGTAACGCTA GACGGTAAAC GCATAGTCCT ACTTGATAGG 120 

AGGTTTAGGC CCTACTTCTA TGCACTCATA TCCCGCGACT ACGAAGGTAA GGCCGAGGAG 180 

GTAGTAGCTG CTATTAGAAG GCTAAGTATG GCAAAGAGCC CCATAATAGA AGCAAAGGTG 24 0 

GTTAGTAAGA AGTACTTCGG AAGGCCCCGT AAAGCAGTCA AAGTAACGAC AGTTATACCC 300 

GAATCTGTCA GAGAATATAG AGAGGCTGTA AAAAAGCTGG AAGGCGTGGA AGACTCTCTA 3 60 

GAAGCAGACA TAAGGTTCGC GATGAGGTAT CTAATCGACA AGAAGCTCTA CCCGTTCACA 4 20 
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w 



15 



25 



30 



35 



40 



GCATACCGTG 


TCAGAGCCGA 


GAACGCTGGA 


CGCAGCCCTG 


GTTTCCGTGT 


AGACTCGGTA 


480 


TACACTATAG 


TTGAGGACCC 


AGAGCCTATT 


GCCGACATAA 


CTAGTATAGA 


TATACCAGAG 


540 


ATGCGTGTGC 


TCGCGTTCGA 


CATAGAGGTC 


TACAGTAAGA 


GAGGAAGCCC 


TAACCCGTCC 


600 


CGCGACCCGG 


TCATAATAAT 


CTCGATAAAG 


GACAGCAAGG 


GGAACGAGAA 


GCTACTAGAA 


660 


GCCAATAACT 


ACGACGACAG 


AAACGTGCTA 


CGGGAATTTA 


TAGAGTACAT 


ACGCTCCTTT 


720 


GACCCAGACA 


TAATAGTAGG 


CTACAATAGC 


AACAATTTTG 


ACTGGCCATA 


CCTTATAGAA 


780 


CGTGCACACA 


GAATAGGAGT 


AAAGCTCGAC 


GTGACAAGGC 


GTGTTGGCGC 


AGAGCCAAGT 


840 


ATGAGCGTCT 


ATGGACATGT 


CTCAGTGCAG 


GGTAGGCTAA 


ACGTAGACCT 


CTACAACTAC 


900 


GTGGAGGAAA 


TGCATGAGAT 


AAAGGTAAAG 


ACGCTCGAGG 


AGGTCGCCGA 


ATACCTAGGC 


960 


GTTATGCGCA 


AGAGCGAGCG 


CGTACTAATA 


GAATGGTGGC 


GGATCCCAGA 


TTACTGGGAC 


1020 


GACGAGAAGA 


AACGGCCGCT 


ACTGAAGCGT 


TATGCCCTCG 


ACGATGTGAG 


AGCCACCTAC 


1080 


GGCCTCGCCG 


AGAAGATACT 


CCCATTCGCA 


ATACAGCTTT 


CGACAGTAAC 


CGGTGTTCCT 


1140 


TTAGACCAAG 


TCGGGGCTAT 


GGGCGTAGGT 


TTCCGTCTAG 


AATGGTACCT 


TATGAGAGCA 


1200 


GCGCATGATA 


TGAACGAGCT 


TGTCCCCAAC 


CGTGTCAAGC 


GGCGCGAAGA 


GAGCTACAAG 


1260 


GGAGCAGTAG 


TACTAAAGCC 


CCTAAAGGGT 


GTCCATGAGA 


ACGTAGTAGT 


GCTCGACTTT 


1320 


AGCTCAATGT 


ACCCCAACAT 


AATGATAAAG 


TACAATGTGG 


GCCCTGACAC 


GATAATTGAC 


1380 


GACCCCTCAG 


AGTGCGAGAA 


GTACAGTGGA 


TGCTACGTAG 


CCCCCGAAGT 


CGGGCACATG 


1440 


TTTAGGCGCT 


CGCCCTCCGG 


CTTCTTTAAG 


ACCGTGCTTG 


AGAACCTCAT 


AGCGCTGCGT 


1500 


AAGCAAGTAC 


GTGAAAAGAT 


GAAGGAGTTC 


CCCCCAGATA 


GCCCAGAATA 


CCGGATATAC 


1560 


GATGAACGCC 


AGAAGGCACT 


CAAGGTGCTA 


GCCAACGCTA 


GCTACGGCTA 


CATGGGATGG 


1620 


GTGCACGCTC 


GCTGGTACTG 


TAAACGCTGC 


GCAGAGGCTG 


TAACAGCCTG 


GGGCCGTAAC 


1680 


CTGATACTCT 


CAGCAATAGA 


ATATGCTAGG 


AAGCTCGGCC 


TCAAAGTAAT 


ATACGGAGAC 


1740 


ACGGACTCCC 


TATTCGTAAC 


CTATGATATL 








1800 


GAGAAACAGC 


TAGGCTTCGA 


GATAAAGATA 


GACAAGGTAT 


ACAAAAGAGT 


GTTCTTTACC 


1860 


GAGGCAAAGA 


AGCGCTACGT 


GGGCCTCCTC 


GAGGACGGGC 


GTATGGACAT 


AGTAGGCTTT 


1920 


GAGGCTGTTA 


GAGGCGACTG 


GTGTGAGCTA 


GCTAAAGAGG 


TGCAAGAGAA 


AGTAGCAGAG 


1980 


ATAATACTGA 


AGACGGGAGA 


CATAAATAGA 


GCCATAAGCT 


ACATAAGAGA 


GGTCGTGAGA 


2040 


AAGCTAAGAG 


AAGGCAAGAT 


ACCCATAACA 


AAGCTCGTAA 


TATGGAAGAC 


CTTGACAAAG 


2100 
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P5 



AGAATCGAGG AATACGAGCA CGAGGCGCCG CACGTTACTG CAGCACGGCG TATGAAAGAA 2160 

GCAGGCTACG ATGTGGCACC GGGAGACAAG ATAGGCTACA TCATAGTTAA AGGACATGGC 2220 

AGTATATCGA GTCGTGCCTA CCCGTACTTT ATGGTAGACT CGTCTAAGGT TGACACAGAG 2280 

TACTACATAG ACCACCAGAT AGTACCAGCA GCAATGAGGA TACTCTCATA CTTCGGGGTC 234 0 

ACAGAGAAGC AGCTTAAGGC AGCATCATCT GGGCATAGGA GTCTCTTCGA CTTCTTCGCG 2400 

GCAAAGAAGT AGCCCCGGCT CTCCAAACTA 24 30 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Glu Ala He Glu Phe Val Leu Leu Asp Ser Ser Tyr Glu He 
15 10 15 

Val Gly Lys Glu Pro Val He He Leu Trp Gly Val Thr Leu Asp Gly 
20 25 30 

Lys Arg He Val Leu Leu Asp Arg Arg Phe Arg Pro Tyr Phe Tyr Ala 
35 40 45 

Leu He Ser Arg Asp Tyr Glu Gly Lys Ala Glu Glu Val Val Ala Ala 

50 55 60 

He Arg Arg Leu Ser Met Ala Lys Ser Pro He He Glu Ala Lys Val 

65 70 75 80 

Val Ser Lys Lys Tyr Phe Gly Arg Pro Arg Lys Ala Val Lys Val Thr 
85 90 95 

Thr Val He Pro Glu Ser Val Arg Glu Tyr Arg Glu Ala Val Lys Lys 
100 105 110 

Leu Glu Gly Vai Glu Asp Ser Leu Glu Ala Asp He Arg Phe Ala Met 
115 120 125 

Arg Tyr Leu He Asp Lys Lys Leu Tyr Pro Phe Thr Ala Tyr Arg Val 
130 135 140 

Arg Ala Glu Asn Ala Gly Arg Ser Pro Gly Phe Arg Val Asp Ser Val 
145 150 155 160 

Tyr Thr He Val Glu Asp Pro Glu Pro He Ala Asp He Thr Ser He 
165 170 175 
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Asp lie Pro Glu Met Arg Val Leu Ala Phe Asp He Glu Val Tyr Ser 
180 185 190 

Lys Arg Gly Ser Pro Asn Pro Ser Arg Asp Pro Val He lie He Ser 
195 200 205 

He Lys Asp Ser Lys Gly Asn Glu Lys Leu Leu Glu Ala Asn Asn Tyr 
210 215 220 

Asp Asp Arg Asn Val Leu Arg Glu Phe He Glu Tyr He Arg Ser Phe 
225 230 235 240 

Asp Pro Asp He He Val Gly Tyr Asn Ser Asn Asn Phe Asp Trp Pro 
245 250 255 

Tyr Leu He Glu Arg Ala His Arg He Gly Val Lys Leu Asp Val Thr 
260 265 270 

Arg Arg Val Gly Ala Glu Pro Ser Met Ser Val Tyr Gly His Val Ser 
275 280 285 

Val Gin Gly Arg Leu Asn Val Asp Leu Tyr Asn Tyr Val Glu Glu Met 
290 295 300 

His Glu He Lys Val Lys Thr Leu Glu Glu Val Ala Glu Tyr Leu Gly 
305 310 315 320 

Val Met Arg Lys Ser Glu Arg Val Leu He Glu Trp Trp Arg He Pro 
325 330 335 

Asp Tyr Trp Asp Asp Glu Lys Lys Arg Pro Leu Leu Lys Arg Tyr Ala 
340 345 350 

Leu Asp Asp Val Arg Ala Thr Tyr Gly Leu Ala Glu Lys He Leu Pro 
355 360 365 

Phe Ala He Gin Leu Ser Thr Val Thr Gly Val Pro Leu Asp Gin Val 

370 375 380 

Gly Ala Met Gly Val Gly Phe Arg Leu Glu Trp Tyr Leu Met Arg Ala 
385 390 395 400 

Ala His Asp Met Asn Glu Leu Val Pro Asn Arg Val Lys Arg Arg Glu 
405 410 415 

Glu Ser Tyr Lys Gly Ala Val Val Leu Lys Pro Leu Lys Gly Val His 
420 425 430 

Glu Asn Val Val Val Leu Asp Phe Ser Ser Met Tyr Pro Asn He Met 
435 440 445 

He Lys Tyr Asn Val Gly Pro Asp Thr He He Asp Asp Pro Ser Glu 
450 455 460 

Cys Glu Lys Tyr Ser Gly Cys Tyr Val Ala Pro Glu Val Gly His Met 
465 470 475 480 
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Phe Arg Arg Ser Pro Ser Gly Phe Phe Lys 
485 490 

He Ala Leu Arg Lys Gin Val Arg Glu Lys 
500 505 

Asp Ser Pro Glu Tyr Arg He Tyr Asp Glu 
515 520 

Val Leu Ala Asn Ala Ser Tyr Gly Tyr Met 
530 535 

Trp Tyr Cys Lys Arg Cys Ala Glu Ala Val 
545 550 

Leu He Leu Ser Ala He Glu Tyr Ala Arg 

565 570 

He Tyr Gly Asp Thr Asp Ser Leu Phe Val 
580 585 

Val Lys Lys Leu He Glu Phe Val Glu Lys 
595 600 

Lys He Asp Lys Val Tyr Lys Arg Val Phe 
610 615 

Arg Tyr Val Gly Leu Leu Glu Asp Gly Arg 
625 630 

Glu Ala Val Arg Gly Asp Trp Cys Glu Leu 

645 650 

Lys Val Ala Glu He He Leu Lys Thr Gly 
660 665 

Ser Tyr He Arg Glu Val Val Arg Lys Leu 
675 680 

He Thr Lys Leu Val lie Trp Lys Thr Leu 
690 695 

Tyr Glu His Glu Ala Pro His Val Thr Ala 
705 710 

Ala Gly Tyr Asp Val Ala Pro Gly Asp Lys 

725 730 

Lys Gly His Gly Ser He Ser Ser Arg Ala 
740 745 

Asp Ser Ser Lys Val Asp Thr Glu Tyr Tyr 
755 760 

Pro Ala Ala Met Arg He Leu Ser Tyr Phe 

770 775 



Thr Val Leu Glu Asn Leu 
495 



Met Lys 

Arg Gin 

Gly Trp 
540 

Thr Ala 
555 

Lys Leu 
Thr Tyr 
Gin Leu 

Phe Thr 

620 

Met Asp 
635 

Ala Lys. 

Asp He 

Arg Glu 

Thr Lys 
700 

Ala Arg 
715 

He Gly 
Tyr Pro 
He Asp 



Gly Val 
780 



Glu Phe Pro Pro 
510 

Lys Ala Leu Lys 
525 

Val His Ala Arg 



Trp Gly Arg Asn 
560 

Gly Leu Lys Val 

575 

Asp He Glu Lys 
590 

Gly Phe Glu He 
605 

Glu Ala Lys Lys 



He Val Gly Phe 
640 

Glu Val Gin Glu 
655 

Asn Arg Ala He 
670 

Gly Lys He Pro 
685 

Arg He Glu Glu 



Arg Met Lys Glu 
720 

Tyr He He Val 
735 

Tyr Phe Met Val 

750 

His Gin He Val 
765 

Thr Glu Lys Gin 
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Leu Lys Ala Ala Ser Ser Gly Kis Arg Ser Leu Phe Asp Phe Phe Ala 
785 790 795 800 

5 Ala Lys Lys 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2430 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 





ATGACAGAGA 


CTATAGAGTT 


CGTGCTGCTA 


GACTCTAGCT 


ACGAGATACT 


GGGGAAGGAG 


60 




CCGGTAGTAA 


TCCTCTGGGG 


GATAACGCTT 


GACGGTAAAC 


GTGTCGTGCT 


TCTAGACCAC 


120 




CGCTTCCGCC 


CCTACTTCTA 


CGCCCTCATA 


GCCCGGGGCT 


ATGAGGATAT 


GGTGGAGGAG 


180 




ATAGCAGCTT 


CCATAAGGAG 


GCTTAGTGTG 


GTCAAGAGTC 


CGATAATAGA 


TGCCAAGCCT 


240 


?5 


CTTGATAAGA 


GGTACTTCGG 


CAGGCCCCGT 


AAGGCGGTGA 


AGATTACCAC 


TATGATACCC 


300 




GAGTCTGTTA 


GACACTACCG 


CGAGGCGGTG 


AAGAAGATAG 


AGGGTGTGGA 


GGACTCCCTC 


360 




GAGGCAGATA 


TAAGGTTTGC 


AATGAGATAT 


CTGATAGATA 


AGAGGCTCTA 


CCCGTTCACG 


420 


30 


GTTTACCGGA 


TCCCCGTAGA 


GGATGCGGGC 


CGCAATCCAG 


GCTTCCGTGT 


TGACCGTGTC 


480 




TACAAGGTTG 


CTGGCGACCC 


GGAGCCCCTA 


GCGGATATAA 


CGCGGATCGA 


CCTTCCCCCG 


540 




ATGAGGCTGG 


TAGCTTTTGA 


TATAGAGGTG 


TATAGCAGGA 


GGGGGAGCCC 


TAACCCTGCA 


600 


35 


AGGGATCCAG 


TGATAATAGT 


GTCGCTGAGG 


GACAGCGAGG 


GCAAGGAGAG 


GCTCATAGAA 


660 




GCTGAAGGCC 


ATGACGACAG 


GAGGGTTCTG 


AGGGAGTTCG 


TAGAGTACGT 


GAGAGCCTTC 


720 




GACCCCGACA 


TAATAGTGGG 


CTATAACAGT 


AACCACTTCG 


ACTGGCCCTA 


CCTAATGGAG 


780 


40 


CGCGCCCGTA 


GGCTCGGGAT 


TAACCTCGAC 


GTTACACGCC 


GTGTGGGGGC 


AGAGCCCACC 


840 




ACCAGCGTCT 


ACGGCCACGT 


CTCGGTGCAG 


GGTAGGCTGA 


ACGTGGACCT 


CTACGACTAT 


900 




GCCGAGGAGA 


TGCCGGAGAT 


AAAGATGAAG 


ACGCTTGAGG 


AGGTAGCGGA 


GTACCTAGGC 


960 


45 


GTTATGAAGA 


AGAGCGAGCG 


TGTGATAATA 


GAGTGGTGGA 


GGATACCCGA 


GTACTGGGAT 


1020 




GACGAGAAGA 


AGAGGCAGCT 


GCTAGAGCGC 


TACGCGCTCG 


ACGATGTGAG 


GGCTACCTAC 


1080 




GGCCTCGCGG 


AAAAGATGCT 


ACCGTTCGCC 


ATACAGCTCT 


CCACTGTTAC 


GGGTGTGCCT 


1140 



50 



55 
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CTCGACCAGG TAGGTGCTAT GGGCGTAGGC TTCCGCCTAG AGTGGTATCT CATGCGTGCA 1200 

GCCTACGATA TGAACGAGCT GGTGCCGAAC CGGGTGGAGA GGAGGGGGGA GAGCTACAAG 12 60 

5 

GGTGCAGTAG TGTTAAAGCC TCTCAAGGGA GTCCATGAGA ATGTTGTGGT GCTCGATTTC 1320 

AGTTCCATGT ACCCGAGCAT AATGATAAAG TACAACGTGG GCCCCGACAC TATAGTCGAC 1380 

GACCCCTCGG AGTGCCCAAA GTACGGCGGC TGCTATGTAG CCCCCGAGGT CGGGCACCGG 1440 

TTCCGTCGCT CCCCGCCAGG CTTCTTCAAG ACCGTGCTCG AGAACCTACT GAAGCTACGC 1500 

CGACAGGTAA AGGAGAAGAT GAAGGAGTTT CCGCCTGACA GCCCCGAGTA CAGGCTCTAC 15 60 

GATGAGCGCC AGAAGGCGCT CAAGGTTCTT GCGAACGCGA GCTATGGCTA CATGGGGTGG 1620 

AGCCATGCCC GCTGGTACTG CAAACGCTGC GCCGAGGCTG TCACAGCCTG GGGCCGTAAC 1680 

CTTATACTGA CAGCTATCGA GTATGCCAGG AAGCTCGGCC TAAAGGTTAT ATATGGAGAC 1740 

^0 ACCGACTCCC TCTTCGTGGT CTATGACAAG GAGAAGGTTG AGAAGCTGAT AGAGTTTGTC 1800 

GAGAAGGAGC TGGGCTTTGA GATAAAGATA GACAAGATCT ACAAGAAAGT GTTCTTCACG 18 60 

GAGGCTAAGA AGCGCTATGT AGGTCTCCTC GAGGACGGAC GTATAGACAT CGTGGGCTTT 1920 

PS GAAGCAGTCC GCGGCGACTG GTGCGAGCTG GCTAAGGAGG TGCAGGAGAA GGCGGCTGAG 1980 

ATAGTGTTGA ATACGGGGAA CGTGGACAAG GCTATAAGCT ACATAAGGGA GGTAATAAAG 2040 

CAGCTCCGCG AGGGCAAGGT GCCAATAACA AAGCTTATCA TATGGAAGAC GCTGAGCAAG 2100 

30 AGGATAGAGG AGTACGAGCA TGACGCGCCT CATGTGATGG CTGCACGGCG TATGAAGGAG 2160 

GCAGGCTACG AGGTGTCTCC CGGCGATAAG GTGGGCTACG TCATAGTTAA GGGTAGCGGG 2220 

AGTGTGTCCA GCAGGGCCTA CCCCTACTTC ATGGTTGATC CATCGACCAT CGACGTCAAC 2280 

35 TACTATATTG ACCACCAGAT AGTGCCGGCT GCTCTGAGGA TACTCTCCTA CTTCGGAGTC 234 0 

ACCGAGAAAC AGCTCAAGGC GGCGGCTACG GTGCAGAGAA GCCTCTTCGA CTTCTTCGCC 2400 

TCAAAGAAAT AGCTCCTCCA CCCGGCTAGC 2430 

40 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

50 



55 
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Met Thr Glu Thr 
1 

Leu Gly Lys Glu 
20 



He Glu 

5 



Phe Val 

Pro Val Val He 



Lys Arg Val Val Leu Leu Asp His 
35 40 

Leu He Ala Arg Gly Tyr Glu Asp 

50 55 



Leu Leu Asp 
10 

Leu Trp Gly 
25 

Arg Phe Arg 
Met Val Glu 



Ser Ser 
He Thr 



Pro Tyr 
45 

Glu He 
60 



Tyr Glu He 
15 

Leu Asp Gly 
30 

Phe Tyr Ala 
Ala Ala Ser 



He Arg Arg Leu 

65 

Leu Asp Lys Arg 



Thr Met He Pro 
100 

He Glu Gly Val 
115 

Arg Tyr Leu He 
130 



Ser Val 

70 



Val Lys 
Gly Arg 
Glu Ser Val Arg 



Tyr Phe 
85 



Glu Asp 
Asp Lys 



Ser Leu 
120 

Arg Leu 

135 



Ser Pro He 
75 

Pro Arg Lys 
90 

His Tyr Arg 
105 

Glu Ala Asp 
Tyr Pro Phe 



He Asp 
Ala Val 
Glu Ala 



He Arg 
125 

Thr Val 
140 



Ala Lys Pro 
80 

Lys He Thr 
95 

Val Lys Lys 
110 

Phe Ala Met 



Tyr Arg He 



Pro Val Glu Asp 
145 

Tyr Lys Val Ala 



Asp Leu Pro Pro 
180 

Arg Arg Gly Ser 

195 



Ala Gly 
150 

Gly Asp 
165 



Arg Asn 



Pro Glu 



Met Arg Leu Val 
Pro Asn 



Leu Arg Asp Ser Glu Gly 
210 



Pro Ala 

200 

Lys Glu 
215 



Asp Asp Arg Arg 
225 

Asp Pro Asp He 



Tyr Leu Met Glu 
260 



Val Leu 
230 



Arg Glu 
Gly Tyr 
Arg Ala Arg Arg 



He Val 
245 



Pro Gly Phe 
155 

Pro Leu Ala 
170 

Ala Phe Asp 
185 

Arg Asp Pro 
Arg Leu He 



Phe Val Glu 
235 

Asn Ser Asn 

250 

Leu Gly He 
265 



Arg Val 
Asp He 
He Glu 

Val He 

205 

Glu Ala 
220 

Tyr Val 
His Phe 
Asn Leu 



Asp Arg Val 

160 

Thr Arg He 
175 

Val Tyr Ser 
190 

He Val Ser 
Glu Gly His 



Arg Ala Phe 
240 

Asp Trp Pro 
255 

Asp Val Thr 
270 



Arg Arg Val Gly Ala Glu 
275 



Val Gin Gly Arg 
290 



Leu Asn 



Pro Thr 

280 

Val Asp 
295 



Thr Ser Val 



Leu Tyr Asp 



Tyr Gly 
285 

Tyr Ala 

300 



His Val Ser 



Glu Glu Met 
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Pro Glu 
305 



lie Lys Met Lys 
310 



Thr Leu Glu 



Glu Val 
315 



Ala Glu 



25 



Val Met Lys Lys Ser Glu Arg Val He 
325 



He Glu 

330 



Glu Tyr 
Leu Asp 

Phe Ala 

370 

Gly Ala 
385 

Ala Tyr 

Glu Ser 

Glu Asn 

He Lys 
450 

Cys Pro 
465 

Phe Arg 
Leu Lys 
Asp Ser 



Val Leu 
530 

Trp Tyr 
545 



Trp Asp Asp Glu 
340 

Asp Val Arg Ala 

355 

He Gin Leu Ser 



Met Gly Val Gly 
390 

Asp Met Asn Glu 
405 

Tyr Lys Gly Ala 
420 

Val Val Val Leu 
435 

Tyr Asn Val Gly 



Lys Tyr Gly Gly 
470 

Arg Ser Pro Pro 
485 

Leu Arg Arg Gin 

500 

Pro Glu Tyr Arg 
515 

Ala Asn Ala Ser 



Cys Lys Arg Cys 
550 



Leu He Leu Thr Ala He 
565 



50 



He Tyr 
Val Glu 



Gly Asp Thr Asp 
580 

Lys Leu He Glu 
595 



Lys Lys Arg 
345 

Thr Tyr Gly 
360 

Thr Val Thr 
375 

Phe Arg Leu 



Leu Val Pro 

Val Val Leu 
425 

Asp Phe Ser 
440 

Pro Asp Thr 
455 

Cys Tyr Val 

Gly Phe Phe 

Val Lys Glu 
505 

Leu Tyr Asp 

520 

Tyr Gly Tyr 
535 

Ala Glu Ala 
Glu Tyr Ala 



Ser Leu Phe 
585 

Phe Val Glu 

600 



Trp Trp 
Gin Leu Leu Glu 



Leu Ala 



Gly Val 



Glu Trp 

395 

Asn Arg 
410 



Glu Lys 

365 

Pro Leu 
380 

Tyr Leu 



Val Glu 



Lys Pro Leu Lys 
Ser Met 



He Val 



Ala Pro 
475 

Lys Thr 
490 



Tyr Pro 
445 

Asp Asp 
460 

Glu Val 



Val Leu 



Lys Met Lys Glu 
Glu Arg 
Met Gly 



Gin Lys 
525 



Trp Ser 
540 



Val Thr 

555 



Ala Trp 
Leu Gly 
Val Val Tyr Asp 



Arg Lys 

570 



Lys Glu 



Leu Gly 
605 



Tyr Leu Gly 

320 

Arg He Pro 
335 

Arg Tyr Ala 
350 

Met Leu Pro 



Asp Gin Val 



Met Arg Ala 
400 

Arg Arg Gly 
415 

Gly Val His 
430 

Ser He Met 



Pro Ser Glu 

Gly His Arg 
480 

Glu Asn Leu 
495 

Phe Pro Pro 
510 

Ala Leu Lys 
His Ala Arg 



Gly Arg Asn 
560 

Leu Lys Val 
575 

Lys Glu Lys 
590 

Phe Glu He 
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Lys He Asp Lys He Tyr Lys Lys Val Phe Phe Thr Glu Ala Lys Lys 
610 615 620 

5 Arg Tyr Val Gly Leu Leu Glu Asp Gly Arg He Asp He Val Gly Phe 

625 630 635 640 

Glu Ala Val Arg Gly Asp Trp Cys Glu Leu Ala Lys Glu Val Gin Glu 
645 650 655 

'0 Lys Ala Ala Glu He Val Leu Asn Thr Gly Asn Val Asp Lys Ala He 

660 665 670 



Ser Tyr He Arg Glu Val 

675 

15 



20 



25 



Pro Ala Ala Leu Arg He 
770 



He Lys Gin Leu Arg Glu Gly Lys Val Pro 
680 685 



Leu Ser Tyr Phe Gly Val Thr Glu Lys Gin 
775 780 



He Thr Lys Leu He He 
690 



Tyr Glu His Asp Ala Pro 
705 710 



Ala Gly Tyr Glu Val Ser 
725 



Lys Gly Ser Gly Ser Val 
740 



Asp Pro Ser Thr He Asp 
755 



Trp Lys Thr Leu Ser 
695 

His Val Met Ala Ala 

715 

Pro Gly Asp Lys Val 
730 

Ser Ser Arg Ala Tyr 
745 

Val Asn Tyr Tyr He 
760 



Lys Arg He Glu Glu 
700 

Arg Arg Met Lys Glu 

720 

Gly Tyr Val He Val 

735 

Pro Tyr Phe Met Val 
750 

Asp His Gin He Val 
765 



Leu Lys Ala Ala Ala Thr Val Gin Arg Ser Leu Phe Asp Phe Phe Ala 

785 790 795 800 

Ser Lys Lys 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGACCCATAT GCCAGAAGCT ATTGAATTCG TGCTCC 
{2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GGCAGGTACC ACTAGTTATG TCGGCAATAG GCTC 34 
(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTAAGGCAGC ATCATCTGGG CATAGGAGTC TCTTCGACTT CTTCGCGGCA AAGAAGTAAC 60 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



CCGGGTTACT TCTTTGCCGC GAAGAAGTCG AAGAGACTCC TATGCCCAGA TGATGCTGCC 



60 



40 



(2) INFORMATION FOR SEQ ID NO : 9 : 



( i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 



45 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DNA (genomic) 



50 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
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GCTTATAGCC TTGTCCACGT TC 22 
(2) INFORMATION FOR SEQ ID NO: 10: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGACCATGCA TGACTGAAAC TATTGAATTC GTGCTG 3 6 

(2) INFORMATION FOR SEQ ID NO: 11: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
GGAAGGTACC TGATCATCTA GAAGCACGAC ACGTT 35 
^ {?.) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

40 

GGAAGCTGAG CAAGAGGATA GAGG 2 4 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GGAAGGTACC TTATTTCTTT GAGGCGAAGA AG 32 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTTTCGAAA GAAGAAAAAA CC 22 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCTCATATGC TTATCGATAC CC 22 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 
CATAAGCTTA TCGATACCCT T 21 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 



55 



37 



EP 0 624 641 B1 



(C) STRANDEDKESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAGCTTATGA CAGAGACTAT AGAGTT 

w 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

^5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTGGTCTAGA AGCACGACAC GT 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
35 TATTGCCGAC ATAACTAGTA TAGA 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
ACTGTAGACC GCGATCGCGA ACGCGAGC 28 
(?) INFORMATION FOR SEQ ID NO: 21: 
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P5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
CTCGCGTTCG CGATCGCGGT CTACAGTAAG AGAG 34 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TTATCTCATG CATTTCCTCC 20 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTGTCGTGCT TCTAGACCA 19 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 4 : 
GCTATACACC GCGATCGCAA AAGCTACCAG C 31 
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(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGTAGCTTTT GCGATCGCGG TGTATAGCAG GA 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 : 
TACGGGCGCG CTCCATTAG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCGATAGTTT GAGTTCTTCT ACTCAGGC 28 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GAAGAAAGCG AAAGGAGCGG GCGCTAGGGC 30 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
{O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GCACCCCGCT TGGGCAGAG 19 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 30: 
GCACCCCGCT TGGGCAGAA 19 
(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GCACCCCGCT TGGGCAGAT 19 
(2) INFORMATION FOR SEQ ID NO: 32: 

'^^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
GCACCCCGCT TGGGCAGAC 
(2) INFORMATION FOR SEQ ID NO: 33: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33 
TCCCGCCCCT CCTGGAAGAC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34 
GATAAAGATA GACAAGGTAT AC 
(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35 
CGTATTCCTC GATTCTCTTT 
(2) INFORMATION FOR SEQ ID NO : 3 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
TTCGCATATG CCATTTGCAA TACAACTTTC GACAGTAACC 4 0 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
TTCGCATATG GGTGTAGGTT TTCGTCTAGA ATGGTAC 37 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CGCATATGAA CGAACTGGTT CCCAACCGTG TCAAG 35 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 
GTCAGGGCCC ACATTGTACT T 21 



Claims 

1. A thermostable DNA polymerase having the amino acid sequence from amino terminus to carboxy terminus of 
SEQ ID NO: 2 or SEQ ID NO; 4. 
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2. An N terminal shortened fragment of the polymerase as claimed in claim 1 having the amino acid sequence of SEQ 
ID NO: 2 but lacking up to one third of the amino terminal sequence of the amino acid sequence SEQ ID NO: 2. 



3. An N-temninal shortened fragment of the polymerase as claimed in claim 2 lacking exonuclease activity, preferably 
5 an N-terminal shortened fragment selected from the group of 

(a) the 366 amino acid deletion mutant having the amino acid sequence from position 1, which is methionine, 
immediately followed by amino acids 368 through 803 of SEQ ID NO: 2; 

(b) the 386 amino acid deletion mutant having the amino acid sequence from position 1, which is methionine, 
w immediately followed by amino acids 388 through 803 of SEQ ID NO: 2; or 

(c) the 403 amino acid deletion mutant having the amino acid sequence from position 1, which is methionine, 
immediately followed by amino acids 405 through 803 of SEQ ID NO; 2. 

4. An N terminal shortened fragment of the polymerase as claimed in claim 1 having the amino acid sequence of SEQ 
15 ID NO: 4 but lacking up to one third of the amino terminal sequence of the amino acid sequence SEQ ID NO: 4. 

5. A recombinant derivative of the thermostable DNA polymerase claimed in claim 1 , which enzyme has deletions or 
additions the primary structure relative to the amino acid sequence of SEQ ID NO: 2, which modifications do not 
destroy the enzymatic properties of the resulting thermostable enzyme, which properties are: 

20 

(a) it catalyzes the combination of nucleoside triphosphates to form a nucleic acid strand complementary to a 
nucleic acid template strand; 

(b) it is able to function efficiently in a polymerase chain reaction with a denaturing temperature up to lOO^G; 
and 

25 (c) it comprises a 3' to 5' exonuclease activity. 

6. A recombinant derivative of the thermostable DNA polymerase claimed in claim 1 , which enzyme has deletions or 
additions in the primary structure relative to the amino acid sequence of SEQ ID NO: 4, which modifications do not 
destroy the enzymatic properties of the resulting thermostable enzyme, which properties are: 

30 

(a) it catalyzes the combination of nucleoside triphosphates to form a nucleic acid strand complementary to a 
nucleic acid template strand; 

(b) it is able to function efficiently in a polymerase chain reaction with a denaturing temperature up to 100°G; 
and 

35 (c) it comprises a 3' to 5' exonuclease activity. 

7. The themnostable DNA polymerase as claimed in any one of claims 1 to 6, having optimal activity between 70 and 
80X. 

40 8. The thermostable DNA polymerase enzyme as claimed in any one of claims 1 to 7, which is modified by oxidation 
or reduction, whereby this modification does not destroy the DNA polymerase activity of the resutting thermostable 
enzyme. 

9. A DNA encoding a thermostable DNA polymerase as claimed in any one of claims 1 to 7. 

45 

10. The DNA of claim 9 that has the nucleotide sequence of SEQ ID No. 1, or of a subsequence thereof encoding a 
thermostable DNA polymerase enzyme as claimed in claim 2. 

11 . The DNA of claim 9 that has the nucleotide sequence of SEQ ID No. 3, or of a subsequence thereof encoding a 
50 thermostable DNA polymerase enzyme as claimed in claim 4. 

12. A recombinant DNA vector that comprises a DNA sequence encoding a thermostable DNA polymerase as claimed 
tn any one of claims 1 to 7, 

55 13. A recombinant DNA vector as claimed in claim 12, which is pPoc4 (ATCC No. 69309) or pPab14 (ATCC No. 69310). 

14, A host ceil transformed with a DNA vector that comprises a DNA sequence encoding a thermostable DNA polymer- 
ase as claimed in any one of claims 1 to 7 
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15. A poiypeptiue displaying Pyrodiclium DNA polymerase activity produced in a host ceil as claimed in claim 14. 

16. A stable enzyme composition comprising a thermostable DNA polymerase as claimed in any one of claims 1 to 7, 
or a polypeptide as claimed in claim 15 in a buffer containing one or more non-ionic polymeric detergents. 

5 

17. A process for the preparation of a thermostable DNA polymerase as claimed in any one of claims 1 to 7, or a 
polypeptide as claimed in claim 15, which process comprises the steps of: 

(a) cutturing a host cell transformed with a recombinant DNA vector encoding a themnostabte DNA polymerase 
10 as claimed in any one of claims 1 to 7; and 

(b) isolating the thermostable DNA polymerase produced in the host cell from the culture. 

18. A process for amplifying a nucleic acid, characterized in that a thermostable DNA polymerase as claimed in any 
one of claims 1 to 7 or a polypeptide as claimed in claim 15 is used. 

15 

1 9. Use of a thermostable DNA polymerase as claimed in any one of claims 1 to 7 or a polypeptide as claimed in claim 
15 for amplif/ing a nucleic acid. 

20. A kit comprising a thermostable DNA polymerase as claimed in any one of claims 1 to 7 or a polypeptide as claimed 
po in claim 15 or a stable enzyme composition comprising said polymerase in a buffer containing one or more non- 
ionic polymeric detergents, and optionally further reagents useful for performing a PGR reaction such as a set of 
primers, probes or nucleoside triphosphate precursors. 

Patentanspruche 

P5 

1. Thermostabile DNA-Polymerase mit der Aminosauresequenz vom aminoterminalen Ende bis zum carboxytermi- 
nalen Ende der SEQ ID Nr. 2 Oder SEQ ID Nr. 4. 

2. N terminal verkurztes Fragment der Polymerase nach Anspruch 1 mit der Aminosauresequenz von SEQ ID Nr. 2, 
30 der aber bis zu einem Drittel der aminoterminalen Sequenz der Aminosauresequenz SEQ ID Nr. 2 fehlt. 

3. N-terminal verkurztes Fragment der Polymerase nach Anspruch 2, dem Exonuclease-Aktivitat fehft, vorzugsweise 
ein N'terminat verkurztes Fragment ausgewahit bus der Gruppe der 

35 (a) 366 Aminosauren-Deletionsmutante mit der Aminosauresequenz von Position 1, die Methionin ist, unmit- 

telbar gefolgt von den Aminosauren 368 bis 803 der SEQ ID Nr 2: 

(b) 386 Aminosauren-Deletionsmutante mit der Aminosauresequenz von Position 1, die f^ethionin ist, unmit- 
teibar gefolgt von den Aminosauren 388 bis 803 der SEQ ID Nr 2: Oder der 

(c) 403 Aminosauren-Deletionsmutante mit der Aminosauresequenz von Position 1, die Methionin ist, unmit- 
40 telbar gefolgt von Aminosauren 405 bis 803 der SEQ ID Nr. 2. 

4. N-termtnal verkurztes Fragment der Polymerase nach Anspruch 1 mit der Aminosauresequenz von SEQ ID Nr. 4, 
der aber bis zu einem Drittel der aminoterminalen Sequenz der Aminosauresequenz SEQ ID Nr. 4 fehlt. 

45 5. Rekombinantes Derivat derthemnostabilen DNA-Polymerase nach Anspruch 1, wobei das Enzym Deletionen Oder 
Additionen in der Primarstruktur bezogen auf die Aminosauresequenz von SEQ ID Nr. 2 aufweist, wobei die Modi- 
fikationen nicht die enzymatischen Eigenschaften des resultierenden thermostabilen Enzyms zerstoren, wobei die 
Eigenschaften wie folgt sind: 

50 (a) es katalysiert die Vereinigung von Nucleosidtriphosphaten zur Erzeugung eines Nucleinsaurestrangs, der 

komplementar zu einem Nucleinsaurematrizenstrang ist; 

(b) es kann wirksam in einer Polymerasekettenreaktion mit einer Denaturierungstemperatur bis zu 100 X 
funktionieren; und 

(c) es umfasst eine 3'- nach 5' Exonucleaseaktivttat. 

6. Rekombinantes Derivat der thennostabiien DNA-Polymerase nach Anspruch 1, wobei das Enzym Deletionen oder 
Addiltonen in der Primarstruktur bezogen auf die Aminosauresequenz von SEQ ID Nr 4 aufweist, wobei die Modi- 
fikationen nicht die enzymatischen Eigenschaften des resultierenden thennostabiien Enzyms zerstoren. wobei die 
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Eigenschaften wie tolgt sind: 

(a) es katafysiert die Vereinigung von Nucleosidtriphosphaten zur Erzeugung eines Nucleinsaurestrangs, der 
komplementar zu einem Nucleinsaurematrizenstrang ist; 
5 (b) es kann wirksam in einer Pofymerasekettenreaktion mit einer Denaturierungstemperatur bis zu 100 

funktionieren; und 

(c) es umfasst eine 3'- nach 5'-Exonuclease-Aktivitat. 

7. Thennostabile DNA-Polymerase nach einenn der Anspruche 1 bis 6 mit einem Aktivitatsoptimum zwischen 70 und 

w 80°C. 

8. Themiostabiies DNA-Polymeraseenzym nach einem der Anspruche 1 bis 7, das durch Oxidation oder Reduktion 
moditiziert ist, wobei die Modifikation nicht die DNA-Pofymeraseaktivitat des resuitierenden thermostabilen 
Enzyms zerstbrt. 

15 

9. DNA, die eine thermostablle DNA-Polymerase nach einem der Anspruche 1 bis 7 codiert. 

10. DNA nach Anspruch 9, die die Nucleotidsequenz von SEQ ID Nr. 1 hat, oder eine Untersequenz davon, die ein 
thermostabiles DNA-Potymeraseenzym nach Anspruch 2 codiert 

20 

1 1 . DNA nach Anspruch 9, die die Nucleotidsequenz von SEQ ID Nr. 3 hat oder eine Untersequenz davon. die ein ther- 
mostabiles DNA-Polymeraseenzym nach Anspruch 4 codiert. 

1 2. Rekombinanter DNA-Vektor, der eine DNA-Sequenz umfasst, die eine thermostablle DNA-Polymerase nach einem 
25 der Anspruche 1 bis 7 codiert. 

13. Rekombinanter DNA-Vektor nach Anspruch 12, der pPoc4 (ATCC Nr. 69309) oder pPab 14 (ATCC Nr. 69310) ist. 

14. Wirtszelle, die mit einem DNA-Vektor transformiert ist, der eine DNA-Sequenz umfasst, die eine thermostablle 
30 DNA-Polymerase nach einem der Anspruche 1 bis 7 codiert. 

15. Polypeptid, das Pyrodictium-DNA-Polymeraseaktivitat zeigt, das in einer Wirtszelle nach Anspruch 14 produziert 
wird. 

35 16. Stabile Enzymzusammensetzung, umfassend eine thermostablle DNA-Polymerase nach einem der Anspruche 1 
bis 7 Oder ein Polypeptid nach Anspruch 1 5 in einem Puffer, der eines oder mehrere nicht-ionische polymere Deter- 
gentien enthalt. 

17. Verfahren zur Herstellung einer thermostabilen DNA-Polymerase nach einem der Anspruche 1 bis 7 oder eines 
40 Polypeptids nach Anspruch 15, wobei das Verfahren die folgenden Schritte umfasst: 

(a) Zuchten einer Wirtszelle, die mit einem rekombinanten DNA-Vektor transformiert ist, der eine thermostabile 
DNA-Polymerase nach einem der Anspruche 1 bis 7 codiert; und 

(b) Isolieren der thermostabilen DNA-Polymerase, die in der Wirtszelle produziert v/ird, aus der Kuttur. 

45 

18. Verfahren zur Amplifikation einer Nucleinsaure, dadurch gekennzetchnet, daB eine thermostabile DNA-Polymerase 
nach einem der Anspruche 1 bis 7 oder ein Polypeptid nach Anspruch 15 verwendet wird. 

19. Verwendung einer thermostabilen DNA-Polymerase nach einem der Anspruche 1 bis 7 oder eines Polypeptids 
5o nach Anspruch 15 zur Amplifikation einer Nucleinsaure. 

20. Kit, umfassend eine thermostabile DNA-Polymerase nach einem der Anspruche 1 bis 7 oder ein Polypeptid nach 
Anspruch 15 oder eine stabile Enzymzusammensetzung, die die Polymerase in einem Puffer umfasst, der eines 
Oder mehrere nicht-ionische polymere Detergentien enthalt, und gegebenenfalls weitere Reagentien, die zur 

55 Durchfuhrung einer PCR-Reaktion nijtzlich sind. wie ein Satz von Primern, Sonden oder Nucteosidtriphosphatvor- 

laufern. 
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Revendications 

1. ADN polymerase thermostable ayant la sequence d'acides amines allant de I'extremite amino-terminale a I'extre- 
mite carboxy-terminale de SEQ ID N0:2 ou de SEQ ID N0:4. 

5 

2. Fragment N-terminal raccourci de la polymerase revendiquee dans la revendication 1 ayant la sequence d'acides 
amines de SEQ ID N0:2 mais depourvu d'un segment allant jusqu'a un tiers de !a sequence amino-terminale de 
la sequence d'acides amines SEQ ID N0:2. 

10 3. Fragment N-terminal raccourci de la polymerase telle que revendiquee dans la revendication 2 depourvu d'activite 
d'exonuclease, de preference un fragment N-terminal raccourci choisi dans le groupe constrtue par 

(a) le mutant de deletion a 366 acides amines ayant la sequence d'acides amines commengant a la position 1 , 
qui est la methionine, immediatement suivi par les acides amines 368 a 803 de SEQ ID NO: 2 ; 
15 (b) le mutant de deletion a 386 acides amines ayant la sequence d'acides amines commen^ant a la position 1 , 

qui est la methionine, immediatement suivi par les acides amines 388 a 803 de SEQ ID N0:2 ; ou 
(c) le mutant de deletion a 403 acides amines ayant la sequence d'acides amines commen9ant a la position 1 , 
qui est la methionine, immediatement suivi par les acides amines 405 a 803 de SEQ ID NO:2. 

20 4. Fragment N-terminal raccourci de la polymerase revendiquee dans la revendication 1 ayant la sequence d'acides 
amines de SEQ ID NO:4 mais depourvu d'un segment allant jusqu'a un tiers de la sequence amino-terminale de 
la sequence d'acides amines SEQ ID NO:4. 

5. Derive recombinant de I'ADN polymerase thermostable revendique dans la revendication 1 . iaquelle enzyme com- 
25 porte des deletions ou des additions dans la structure primaire relativement a la sequence d'acides amines de 
SEQ ID N0:2, ces modifications ne detruisant pas les proprietes enzymatiques de I'enzyme thermostable resul- 
tante, ces proprietes etant les suivantes : 

(a) elle catalyse la combinaison des nucleosides triphosphates pour former un brin d'acide nucleique comple- 
30 mental re au brin matrice d'acide nucleique ; 

(b) elle est capable de fonctionner efficacement dans une reaction en chaine par polymerase avec une tempe- 
rature de denaturation allant jusqu'a 100°C ; 

(c) elle comprend une activite d'exonuclease de 3' en 5'. 

35 6. Derive recombinant de I'ADN polymerase thermostable revendique dans la revendication 1 , Iaquelle enzyme com- 
porte des deletions ou additions dans la structure primaire relativement a la sequence d'acides amines de SEQ ID 
N0:4, ces modifications ne detruisant pas les proprietes enzymatiques de I'enzyme thermostable resultante, ces 
proprietes etant les suivantes ; 

40 (a) elle catalyse la combinaison des nucleoside triphosphates pour former un brin d'acide nucleique comple- 

mentaire d'un brin matrice d'acide nucleique ; 

(b) elle est capable de fonctionner efficacement dans une reaction en chaine par polymerase avec une tempe- 
rature de denaturation allant jusqu'a lOC'C ; 

(c) elle comprend une activite d'exonuclease de 3' en 5'. 

45 

7. ADN polymerase thermostable selon I'une quelconque des revendications 1 a 6, ayant une activite optimale entre 
70 et 80"C. 

8. Enzyme ADN polymerase thermostable selon I'une quelconque des revendications 1 a 7, qui est modiftee par oxy- 
50 dation ou reduction, cette modification ne detruisant pas I'activite d'ADN polymerase de I'enzyme thermostable 

resultante. 

9. ADN codant pour une ADN polymerase thermostable selon I'une quelconque des revendications 1 a 7. 

55 10. ADN de la revendication 9 qui a la sequence nucleotidique de SEQ ID N0;1, ou d'une de ses sous-sequences 
codant pour une enzyme ADN polymerase thermostable selon la revendication 2. 

11. ADN de la revendication 9 qui a la sequence nucleotidique de SEQ ID NO:3, ou d'une de ses sous-sequences 
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codant pour une enzyme ADN polymerase thermostable selon ta revendication 4. 

12. Vecteur d'ADN recombinant qui comprend una sequence d'ADN codant pour une ADN polymerase thermostable 
selon I'une quelconque des revendications 1 a 7. 

5 

13. Vecteur d'ADN recombinant selon la revendication 12, qui est pPoc4 (ATCC No. 69309) ou pPab14 (ATCC No. 
69310). 

14. Cellule hote transformee avec un vecteur d'ADN qui comprend une sequence d'ADN codant pour une ADN poly* 
w merase thermostable selon I'une quelconque des revendications 1 a 7. 

15. Polypeptide presentant I'activite d'ADN polymerase de Pyrodictium produite dans une cellule hote selon la reven- 
dication 14. 

15 16. Composition enzymatique stable comprenant une ADN polymerase thermostable selon I'une quelconque des 
revendications 1 a 7, ou un polypeptide selon la revendication 15 dans un tampon contenant un ou plusieurs deter- 
gents polymeres non ioniques. 

17. Procede de preparation d'une ADN polymerase thermostable selon I'une quelconque des revendications 1 a 7, ou 
20 d'un polypeptide selon la revendication 15, lequel procede comprend les etapes consistant a : 

(a) cultiver une cellules hote transformee avec un vecteur d'ADN recombinant codant pour une ADN polyme- 
rase thermostable selon I'une quelconque des revendications 1 a 7 ; et 

(b) isoler I'ADN polymerase thermostable produite dans la cellule hote a partir de la culture. 

25 

18. Procede pour amplifier un acide nucleique, caracterise en ce qu'on utilise une ADN polymerase thermostable selon 
I'une quelconque des revendications 1 a 7 ou un polypeptide selon la revendication 15. 

19. Utilisation d'une ADN polymerase thermostable selon I'une quelconque des revendications 1 a 7 ou d'un polypep- 
30 tide selon la revendication 15 pour amplifier un acide nucleique. 

20. Trousse comprenant une ADN polymerase thermostable selon Tune quelconque des revendications 1 a 7 ou un 
polypeptide selon la revendication 15, ou une composition enzymatique stable comprenant ladite polymerase dans 
un tampon contenant un ou plusieurs detergents polymeres non ioniques, et facultativement d'autres reactifs utiles 

35 pour realiser une reaction PCR comme un ensemble d'amorces, de sondes ou de precurseurs de nucleoside tri- 

phosphates. 
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